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In  this  study,  heterogeneous  bibliographic  information  resources  from  geographically  distributed 
locations  are  integrated  in  an  automated,  unified  and  controlled  way  by  using  abstract  data  types  through 
the  Message-Object  Model  as  defined  in  Smalltalk-80.  A  unit  of  modularitytpalf  a  "class"  is  developed  that 
defines  operations  to  process  the  data  structures  encapsulated  in  the  class.  The  classes  focus  on  processing 
bibliographic  citations  obtained  from  heterogeneous  on-line  bibliographic  databases  into  a  meta-form  with 
the  goal  of  developing  information  consistency  to  simplify  further  information  analysis.  Classes  developed 
for  the  bibliographic  citation  application  can  speed  program  development  because  the  data  abstractions  can 
be  used  in  processing  generic  information  such  as  dates  regardless  of  the  bibliographic  database  source. 
Prototype  classes  are  developed  to  show  the  ease  in  encapsulating  data  structures  and  behaviors  for  the 
bibliographic  citation  application.  Data  abstractions  provides  a  powerful  integration  technique  that  allow 
the  designer  to  work  with  bibliographic  citation  objects  without  being  encumbered  with  the  details  of 
implementation. 
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In  recent  years,  the  development  of  abstraction  mechanisms  in  languages  has  focused  on  abstract 
data  types  to  "manage  complexity  by  emphasizing  what  is  significant  to  the  user  and  suppressing  what  is 
not"[Sha84].  This  has  lead  to  modem  programming  languages  such  as  Smalltalk-80,  Flavors,  and  Ada. 
Software  methodologies  have  been  developed  to  address  engineering  concerns  in  requirements, 
specificadon,  design,  implementadon,  correctness,  and  reliability  to  reduce  cost  during  the  software 
development  and  maintenance  phases.  The  use  of  abstracdons  to  logically  reduce  the  complexity  of  the 
task  is  aided  by  modern  language  mechanisms  in  that  they  provide  the  language  constructs  to  encapsulate  a 
logical  data  type  and  the  operadons  associated  with  it  The  language  constructs  of  "Classes"  or  "Flavors" 
help  in  the  abstracdon  process.  This  project  is  based  the  use  of  abstracdons  to  obtain  data  consistency  in 
heterogeneous  databases.  Our  specific  implementadon  was  applied  to  bibliographic  information.  Similar 
techniques  may  be  appliea  to  other  types  of  databases,  as  described  in  Chapter  7. 

Bibliographic  citadon  databases  from  heterogeneous  informadon  resources  are  used  widely  in 
research  and  development  work.  These  databases  are  often  accessed  to  do  a  subject  search  or  to  prepare  a 
bibliography.  The  citadons  contained  in  the  bibliographic  databases  may  be  large  in  number  and  collected 
over  a  long  period  of  time.  This  process  was  done  manually  before  computers  became  readily  available, 
and  was  tedious  and  error  prone.  Today  computers  are  used  widely  for  this  task.  Modem  computer 
automated  tools  have  been  developed  to  assist  in  such  bibliographic  processing  and  are  continually  being 
enhanced[Gol85]. 

A  research  task  may  consist  of  accessing  several  bibliographic  systems  such  as  DIALOG,  INSPEC, 


NASA/RECON,  DOE/RECON,  or  EMDD/DROLS. 


The  respective  retrieved  citation  lists  are  down-loaded  into  a  user  file  for  post-processing  analysis.  Each 
database  has  its  own  form  because  of  independent  development  programs  and  a  lack  of  generally  accepted 
standards.  Hence  post-processing  analysis  on  a  database  citation  file  requires  an  individualized  software 
processing  package  for  each  citation  database.  Somedmes  the  user’s  files  are  merged  if  software  is 
available  to  translate  the  files  into  a  common  format. 

To  analyze  data  from  the  down-loaded  and  merged  files  requires  data  consistency.  Hence  a 
prototype  has  been  developed  to  provide  the  tool  to  make  heterogeneous  bibliographic  citadon  databases 
consistent.  For  example,  the  search  for  citadons  within  a  range  of  dates  is  encumbered  by  the  problem  that 
dates  may  be  represented  in  different  formats  in  different  databases.  Searches  on  author  names  are  also  a 
problem  if  different  databases  enter  first,  middle  or  last  names  in  varying  formats.  The  goal  is  to  have  one 
tool  process  the  heterogeneous  bibliographic  citadons  into  a  standard  form  to  provide  the  basis  for 
convenient  data  analysis. 


Significant  improvements  are  made  by  conceptualizing  the  problem  of  data  consistency  by 
abstracdons  in  terms  of  Smalltalk  classes.  Since  the  information  types  in  citations  are  broadly  similar, 
classes  can  be  developed  for  each  type  of  information  such  as  "date"  or  "title".  Careful  specification  of  the 
classes  can  simplify  the  programmers  task  since  interfaces  will  be  defined,  and  data  and  their  behaviors 
will  be  understood.  Another  improvement  occurs  when  future  enhancements  are  built  on  the  classes 


already  developed  and  serve  to  reduce  the  amount  of  new  software  needed. 


This  study  shows  the  ease  in  developing  prototype  classes  for  integrating  heterogeneous 


bibliographic  citation  databases  and  suggests  the  basis  for  the  development  of  additional  classes  required 


for  the  complete  application.  The  modularity  of  software,  the  inheritance  by  classes,  the  encapsulation  of 


data  structures  and  operations,  and  the  use  of  dynamic  binding  reduce  the  task  of  the  software  designer  and 


developer.  Hence  the  Object-Message  abstraction  narrows  the  gap  between  the  concepts  and  analysis  of  the 


problem  and  the  notation  used  in  the  computer  software  to  solve  the  problem. 


In  the  following  chapters,  we  discuss  the  background,  motivation,  and  development  of  abstract  data 


types  via  SmaIItalk-80  classes  to  solve  the  problem  of  data  consistency  in  heterogeneous  bibliographic 


citation  databases. 


Chapter  2  discusses  previous  methods  used.  Chapter  3  gives  the  characteristics  of  the  Message-Object 
Model.  Chapter  4  discusses  the  physical  hardware  and  software  methods  used  to  create  the  Objective-C 
classes  for  the  prototype.  Chapter  5  discusses  the  specifics  of  the  prototype  implementation,  Chapter  6 
discusses  the  results  of  the  prototype  implementation,  and  the  last  chapter  discusses  future  directions. 


Chapter  2 


Previous  Methods  for  Processing  Heterogeneous 
Bibliographic  Information 


This  chapter  gives  some  background  information  on  bibliographic  citation  databases  and  discusses 
previous  methods  for  processing  the  information. 

2.1  Why  Use  Heterogeneous  Bibliographic  Information  Resources? 

Hall  and  Brown  provide  a  statistical  study  of  the  on-line  bibliographic  databases  that  is  the  basis  of 
this  section[Hal83].  Online  databases  have  been  available  since  the  1960s  but  have  mostly  been  in-house. 
Since  1972,  there  has  been  a  rapid  growth  of  publicly  accessible  databases. 

Table  I 

Number  of  Bibliographic  References  Available  Online 
in  millions 


1968 

1972 

1976 

1980 

1982 

1/4 

3 

20 

58 

HHw 

The  current  rate  of  addition  is  8.7  million  references  per  year.  With  duplication  accounted  for,  the 
estimate  is  50  million  singular  references  available  for  use  and  six  million  additions  to  the  reference  pool 


Parallel  to  the  four-fold  increase  from  1976  to  1982,  the  growth  in  on-line  use  is  estimated  to  be  six 


fold  as  seen  in  Table  II. 


Table  U 

Bibliographic  Searches  on  Public  Systems  in  U.S.A.  and  Canada 
in  millions 


1975 

1977 

1979 

1981 

1 

2 

4 

6 

There  are  four  particularly  predominant  database  services.  They  are  listed  in  Table  III.  Each  suf^lier 
strives  for  uncommon  databases  in  their  service.  Nearly  20  percent  of  the  important  databases  are  not 
available  from  the  four  services. 


Table  III 

Unique  and  Common  Databases  avaUable  from  major  suppliers 


Supplier 

BRS 

DIALOG 

IRS 

ORBIT 

Unique 

8 

39 

5 

24 

Common 

28 

56 

27 

28 

Total  Clumber 

36 

95 

32 

52 

Total  Percent 

21 

55 

18 

30 

The  vast  repertoire  of  information  makes  the  access  to  heterogeneous  bibliographic  information  an 
important  resource  to  a  researcher.  From  Table  III,  we  see  that  a  password  to  DIALOG  gives  access  to 
fifty-five  percent  of  the  databases.  An  additional  password  to  ORBIT  gives  a  total  access  to  seventy 
percent  of  the  databases. 
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Up  to  1984,  more  than  24S3  citation  and  numeric  data  files  were  available  from  362  on-line 
information  vendors[Cua84].  Scientific  disciplines  are  continually  adding  to  the  published  set  of  abstracts 
and  citations.  Most  on-line  bibliographic  information  is  still  obtained  in  printed  form  after  an  on-line 
search.  The  vast  amount  of  informadon  needs  a  tool  with  a  unified  view  to  extract  significant  scientific 
and  technological  intelligence. 


22  Description  of  Bibliographic  Citations 

To  understand  the  problems  involved  in  heterogeneous  bibliographic  citations  a  simple  MEDLINE 
citation  is  described  as  it  is  mounted  on  BRS.  Only  six  fields  were  selectively  down-loaded. 


Sample  Bibliographic  Citation 


[AU]  Bowry-T-R.  Oywang-J.  Lumba-M. 


[IN]  Department  of  Human  Pathology,  Faculty  of  Medicine,  University  of  Nairobi,  Kenya. 


[TI]  HBV  infection:  prevalence  of  core  antibody  and  other  markers  in  urban  based  black  school 
children  in  Kenya. 


[SO]  Ann-Trop-Paediatr.  1983  Dec.  3(4).  P  197-200. 


[LG]  EN.. 


[IS]  0272-4939 


& 


Sirs'll  5  S  5*95 


The  AU  represents  author,  with  hyphens  separating  initials.  LG  tepresents  language  and  IS  is  the 
accession  number  for  the  citation  in  the  particular  database.  IN  represents  the  institutional  affiliation  of  the 
author,  TI  is  the  title,  and  SO  is  the  source.  The  same  bibliographic  citation  from  a  different  database 
source  may  be  formatted  in  a  completely  different  way.  Inconsistency  in  the  detail  held  hinders 
information  analysis[Gol8Sl. 

2  J  Processing  of  Heterogeneous  Bibliographic  Irformation 

With  the  appropriate  administrative  requirements  fulfilled,  a  user  can  down-load  bibliographic 
records  from  a  variety  of  on-line  services  such  as  BRS  or  DIALOG.  Typically,  an  off-line  printing 
follows  a  search,  and  is  arranged  in  reversed  chronological  order.  The  need  for  computer  based  editing 
tools  is  a  natural  consequence.  Rather  than  obtaining  the  down-loaded  information  in  stacks  of  printout, 
the  bibliogr^hic  citations  ate  down-loaded  to  a  disk  file  so  a  computer  can  be  used  ftM*  automated 
processing  of  the  information.  We  observe  two  problems  that  exist  in  local  processing  of  the  file.  The  file 
must  be  translated  into  a  common  form  to  handle  the  different  database  formats  for  data  tags  and  to  handle 
the  inconsistencies  in  the  detail  information  associated  with  each  dau  tag. 

Tools  to  develop  data  consistency  are  available  in  most  modem  database  management  systems. 
Information  consistency  within  a  specific  bibliographic  database  may  also  be  augmented  by  locally 
developed  software  and  procedures.  The  database  administrator  can  use  software  tools  to  constrain  data 
entry  to  meet  certain  requirements.  The  user  may  be  required  to  enter  data  strictly  in  integer  format  within 
a  certain  range  of  values  or  character  format  within  a  certain  string  length.  Furthermore,  the  user  may  be 
required  to  enter  strings  that  are  pre-defined  in  a  dictionary  for  that  attribute,  such  as  one  of  eight 
acceptable  colors.  We  can  see  at  this  point  that  information  may  be  entered  correctly  into  a  particular 
database  in  formats  that  are  singularly  defined  by  the  local  database  administrator.  However,  there  may  be 
inconsistent  formats  among  the  heterogeneous  bibliographic  databases  because  of  a  lack  of  standards  and 
autonomous  database  development  and  administration.  For  example,  dates  can  be  constrained  in  a  local 
database  to  be  either  May  1,  1985  or  1  May  1985  format. 


There  may  be  additional  differences  in  upper/lower  cases,  abbreviations,  spaces,  or  punctuation.  These 
inconsistencies  hinder  the  automated  processing  of  bibliographic  citations  in  the  down-loaded  disk  file. 
Hence  we  find  in  processing  a  search  based  on  date  ranges,  software  must  be  written  to  handle  the  date 
discrepancies,  or  the  search  will  be  incomplete.  Author  names  also  introduce  problems  because  R.  L. 
Smith,  Richard  L.  Smith,  and  Richard  Lee  Smith  are  the  names  of  the  same  author.  If  one  desires  a  list  of 
articles  written  by  R  chard  Lee  Smith  after  a  certain  date,  the  tabulation  would  be  inaccurate. 

A  recent  study  on  popular  'front-end  systems’  available  on  the  market  for  processing  bibliographic 
citations  shows  that  the  user  has  a  limited  choice  of  features  such  as  down-loading  and  file  creation.(i.e., 
SciMate,  InSearch,  CONlT)[Bol84].  Software  is  not  available  to  address  the  problem  of  data  consistency 
among  heterogeneous  databases. 

Goldstein  and  Prettyman  have  developed  software  to  process  down-loaded  citations  with  the  goal  of 
incorporating  a  specified  reference  format  into  manuscripts.  In  their  work  they  encounter  the  typical 
problems  of  processing  heterogeneous  bibliographic  citations. 


They  propose  transforming  each  citation  into  the  following  canonical  format. 


Field  # 

Data  Element 

Tag 

1 

TYPE 

TY 

2 

DATABASE 

DB 

3 

TITLE 

TI 

4 

AUTHOR 

AU 

5 

SOURCE 

SO 

6 

INSTITUTION 

IN 

7 

NO.  &  TYPE  MTG 

NO 

8 

MEETING  TITLE 

TM 

9 

VOLUME  NO. 

VL 

10 

ISSUE 

IS 

11 

MONTH  (JOURNAL) 

MO 

12 

DAY  (JOURNAL) 

DY 

13 

YEAR  (JOURNAL) 

YR 

14 

MONTH  (MEETING) 

MM 

15 

DAY  (MEETING) 

DM 

16 

YEAR  (MEETING) 

YM 

17 

PAGES 

PG 

18 

TOTAL  PAGES 

TP 

19 

PUBLISHER 

PU 

20 

PUBL.CITY 

PT 

21 

PUBL.  STATE 

PS 

22 

PUBL.  COUNTRY 

PC 

23 

PUBLICATION  YR 

PY 

24 

MTG.CITY 

MT 

25 

MTG.STATE 

MS 

26 

MTG.COUNTRY 

MC 

27 

REPORT  NO. 

RN 

28 

RETRIEVAL  NO. 

RG 

29 

ISSN  NO. 

SN 

30 

PART  NUMBER 

PN 

31 

CODEN 

CD 

32 

NOTES 

NT 

33 

EDITOR  TYPE 

ED 

34 

AVAILABILITY 

AV 

35 

COPYRIGHT  YEAR 

CY 

36 

PUBL.AUTHOR 

AA 

Steps  for  pre-processing  records  down-loaded  from  heterogeneous  databases  into  separate  local  files 


m 


m 


'fv 

m 


translate  field  labels  in  all  files  to  a  common  set; 


include  fields  for,  and  add  database  and  retrieval  system  names  to  all  records; 


merge  all  records  into  one  file; 


reorder  the  records  into  a  format  that  is  optimized  for  further  processing; 


determine  and  add  the  type  of  publication; 


standardize  the  format  of  the  author’s  name. 


The  parsing  stage  is  to  separate  the  complex  source  field  into  discrete  information.  Further  details 


are  found  in  Chapter  3. 


Post-processing  is  to  further  format  the  information  for  consistency  in  the  end-product  application 


program.  The  end-product  could  be  a  statistical  analysis  based  on  certain  keywords  or  a  bibliography  for  a 


publication. 


The  post-processing  tasks  are: 


conversion  for  case  consistency; 


standardize  journal  titles; 


correct  inconsistencies  in  format; 


expand  abbreviated  titles; 


add  missing  data; 


make  linkages  between  articles  and  proceedings;  chapter  and  citations. 


The  Goldstein  and  Prettyman  work  involves  knowing  the  database  source  and  then  writing  specific 


software  for  that  bibliographic  database  source.  Their  proposal  for  a  canonical  form  for  bibliographic 


mm 


citation  databases  is  an  attempt  to  develop  standardization  regardless  of  the  bibliographic  citation  sources. 

A  significant  amount  of  work  has  been  done  in  the  processing  of  heterogeneous  bibliographic 
citation  databases  by  the  Technology  Information  System(TIS)  of  the  Lawrence  Livermore  National 
Laboratory(LLNL).  They  have  been  working  on  technology  transfer  through  computer  networks  located 
nationally  and  abroad  since  1975  and  have  developed  the  Integrated  Information  System  (IIS)  that  manages 
information  and  resources  on  the  TIS  system.  IIS  supports  the  down-loading  and  analysis  of  bibliographic 
citations  from  heterogeneous  database  services.  A  major  goal  is  to  provide  the  capability  to  extract 
scientific  and  technological  intelligence  from  the  information  contained  in  these  databases.  To  accomplish 
this,  software  has  been  developed  to  process  bibliographic  citations  from  the  federal  information  centers  of 
the  Department  of  Eneigy  (DOE),  the  Department  of  Defense(DOD),  and  the  National  Aeronautics  and 
Space  Administration(NASA)  as  well  as  the  three  major  U.  S.  commercial  services  —  Lockheed- 
DIALOG,  SDC-ORBIT,  and  BRS.  [Bol84] 

The  Integrated  Information  System  (IIS)  software  package  is  menu-driven  and  provides  for  the 
following  bibliographic  database  options: 


(TRANSLATE] 

[MERGE] 

[STAT] 

[ANALYZE] 

[REVIEW] 

[CONCORD] 

[PERMUTE] 

[CROSS] 

[PLOT] 


translates  citations  to  a  standard  format 

combines  translated  files  from  different  sources  into  one  file 

creates  a  statistical  profile  of  citations 

analyzes  bibliographical  text 

permits  on-line  evaluation  of  citations  for  relevancy. 

creates  indexes  by  author,  subject,  descriptors,  etc. 

issues  multi-term  statistics  of  the  text  in  selected  data  fields 

cross-correlates  the  contents  of  data  fields 

shows  the  number  of  citations  by  year  in  a  graph 


[DISPLAY]  displays  the  contents  of  any  file  on  the  CRT  screen 


% 


*1 


TRANSLATE,  MERGE,  DISPLAY  and  REVIEW  do  the  pre-processing  steps  1-4  mentioned  by 


Goldstein  and  Prettyman.  ANALYZE,  CONCORD,  PERMUTE,  CROSS,  PLOT,  and  STAT  allow  the 


user  to  produce  some  trend  analysis  from  tlie  bibliographic  citations  that  have  gone  through  the 


preprocessing  steps. 


Cunently,  the  trend  analysis  is  not  entirely  accurate  since  the  detail  informadon  is  not  endrely 


consistent.  A  closer  examination  of  the  pre-processed  flies  shows  dates  in  the  following  form: 


1  May  198S 


May  1, 1985 


5/1/1985 


5/1/85 


May  1985 


May,  1985 


The  job  of  producing  a  file  that  is  consistent  is  dme-consuming  and  difficult;  duplicate  bibliographic 


citations  are  not  easily  detected.  A  particular  citadon  usually  contains  only  a  subset  from  the  set  of  data 


tags  and  different  databases  may  enter  certain  detail  informadon  under  different  data  tags.  An  example  is 


the  the  following: 


<DATABASE  SOURCE>  DIALOG  NTIS  FILE  6 


<TITLE>  Online  Directory  of  Databases  for  Material  Properdes 


<DATABASE  SOURCE>  DOE/recon 


<TITLE(MONO)>  Online  directory  of  databases  for  material  properties 


I  vv 


VV 


The  purpose  of  this  project  is  to  further  extend  the  consistency  of  the  detail  information  found  in  a 
merged  file  that  is  the  result  of  down-loading  heterogenous  bibliographic  citation  databases.  It  is  through 
the  development  of  abstract  data  type  Smalltallc-80  classes  that  similar  types  of  information  can  be 
standardized,  regardless  of  source.  The  standardization  of  dates  and  authors  and  titles  include  accounting 
for  spaces,  punctuation,  capitalization,  and  ordering. 
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Chapter  3 


The  Message-Object  Model 


We  first  establish  the  foundation  for  using  abstractions  in  software  development.  Next,  we  discuss 
the  motivation  for  using  abstract  data  types  via  Smalltalk  80  classes  to  solve  the  data  consistency  problem 
in  heterogeneous  bibliographic  citation  databases. 

3.1  Abstraction  Mechanisms  in  Modern  Programming 

Recent  work  in  programming  methodology  has  led  to  the  recognition  of  three  kinds  of  abstractions: 
control,  procedural  and  data.  A  large  effort  has  been  expended  in  developing  a  modem  programming 
methodology  so  software  is  constructed  that  is  easy  to  understand,  modify,  maintain,  and  is  reliable.  The 
quality  of  a  program  depends  on  the  programming  methodology  used.  The  effective  utilization  of  the 
methodology  is  strongly  dependent  on  the  programming  language  selected  for  the  software  development. 
Certain  concepts  in  the  methodology  may  be  difficult  to  put  into  place  if  the  language  does  not  provide  the 
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constructs  that  make  the  process  automatic.  The  language  does  influence  the  way  a  programmer  thinks  and 
formulates  ideas.  A  good  match  of  the  methodology  and  the  language  enhances  the  likehood  that  the 
methodology  will  be  followed.  An  example  would  be  to  attempt  to  introduce  the  concept  of  block  structure 
using  Fortran  66.  A  better  choice  would  be  Pascal  because  the  language  supports  block  structured 
constructs.  While  it  is  true  that  software  can  be  written  in  Fortran  to  simulate  the  methodology,  the  job  is 
unnecessarily  enlarged  for  the  software  implementer(Lis74][Lis77]. 


3.1.1  Software  Abstractions 


What  do  we  mean  by  software  abstractions?  We  mean  that  the  abstraction  isolates  the  use  from  the 
implementation.  That  is  to  say,  that  the  abstraction  can  be  used  without  the  knowledge  of  how  the 
implementation  was  carried  out,  and  the  implementation  can  be  done  without  the  knowledge  of  how  it  is  to 
be  used[Lis77].  In  the  early  1950s,  we  see  the  application  of  abstractions  in  terms  of  assembly  language 
rather  than  machine  language  in  terms  of  octal  numbers.  Three  letter  acronyms  were  used  instead  of  an 
octal  number  that  represented  the  operator.  Operands  were  designated  by  symbolic  labels  rather  than 
absolute  addresses  in  memory.  Early  languages  supported  built-in  data  types  like  integer  and  reals.  One 
did  not  think  in  terms  of  binary  bits  in  a  computer  word  at  a  certain  physical  location  in  memory.  Later 
type  checking  aided  in  appropriate  default  conversions  when  a  real  number  was  added  to  an  integer. 
Hence,  the  programmer  was  relieved  of  low  level  detail.  Procedural  and  control  abstractions  were 
dominant  Sorting  procedures  and  square  root  functions  could  be  specified  without  requiring  knowledge  of 
the  implementation,  and  the  implementation  could  be  done  without  knowledge  of  how  they  were  to  be 
used.  Later,  control  abstractions  such  as  do-loops  were  made  available  so  the  concept  of  iteration  was 
abstracted  by  the  language  construct.  Abstractions  were  treated  as  a  program  organization  technique. 
Programmers  could  define  macros  and  define  new  data  types  required  by  a  specific  problem.  We  note  that 
data  structures  such  as  stacks  and  linked  lists  were  first  treated  systematically  in  1968.  The  idea  of 


studying  and  formalizing  programming  activity  dates  back  to  this  time[Sha84]. 


What  was  recognized  in  early  1970  was  that  programs  were  difficult  to  understand  and  maintain. 


With  the  infamous  gotos  that  spanned  a  large  number  of  software  lines  indiscriminately,  the  term  "spaghetti 


code"  evolved  and  was  a  familiar  occurrence  among  progranuners.  Locality  was  advocated  in  terms  of  if- 


then-else  or  do-while  control  constructs.  For  a  while,  extensible  languages  were  promoted  because  they 


allowed  the  programmer  to  add  new  control  constructs  and  data  types  to  the  base  language  in  an  attempt  to 


add  clarity  to  the  program  and  make  the  programmer’s  tasks  easier.  This  idea  became  unpopular  since  it 


was  difficult  to  keep  independent  extensions  compatible,  to  organize  the  definitions  so  related  information 


were  grouped  together,  and  to  find  a  technique  to  describe  the  extensions  accurately. 


The  need  for  more  accurate  specifications  was  recognized  since  programmers  typically  relied  on 
procedure  headers  and  parameter  lists  with  accompanying  text  to  define  the  procedural  abstraction.  This 
specification  technique  depended  on  individual  styles,  and  some  were  well  written  and  accurate,  while 
others  were  vague  or  out  of  date. 

3.1.2  Structured  Programming  Methodology 

The  structured  programming  methodology  was  developed  in  the  1970s  to  address  these  problems:  to 
make  programs  reliable,  easy  to  understand,  develop  and  maintain.  It  detailed  phases  in  software 
development,  specified  tools  needed  to  assist  in  tlie  process,  and  established  tests  and  criteria  for  program 
correctness.  Program  development  was  to  evolve  top-down  using  the  idea  of  abstractions.  First  the 
statement  of  the  problem  was  presented  and  then  successive  refinements  were  made  until  the  problem  was 
finally  solved.  The  idea  is  to  start  with  a  high  level  abstraction  and  then  progress  by  problem 
decomposition  to  recognizing  subsidiary  abstractions.  This  is  where  we  find  modem  programming 
languages  as  CLU,  Alphard,  ADA,  Concurrent  Pascal,  Euclid,  Gypsy,  Mesa  and  Modula  being  developed 
to  support  the  structured  programming  meihodology[Sha84]. 

3.1.3  Abstract  Data  Types 

Procedural  and  control  abstractions  were  available  but  the  idea  of  abstract  data  types  needed 
promotion.  Through  abstract  data  types,  the  idea  of  locality  would  hence  be  further  extended,  making 
programs  easier  to  design,  implement,  and  maintain.  Specifications  would  be  easier  to  write  because  of  the 
encapsulation  of  the  data  structures.  Data  behaviors  could  be  defined  only  within  the  abstract  data  type. 
The  requirements  of  a  language  supporting  data  abstractions  developed.  Linguistic  constructs  were  needed 
that  implemented  data  abstractions  as  a  unit  in  terms  of  data  representations  and  operations  on  the  data. 
The  construct  would  provide  a  mechanism  by  which  the  language  would  limit  access  to  the  representation 
except  by  the  operations  defined.  Smalltalk  is  such  a  language  with  abstract  data  types  in  terms  of  classes. 
CLU  has  clusters;  Ada  has  packages:  Flav  has  flavors. 
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A  basic  concept  is  that  the  operations  defined  for  a  class  must  include  all  operations  needed  in  handling  the 
data  structure.  Usually  the  operations  include  create,  modify,  and  access  operations.  The  desirability  of 
classes  is  that  the  language  takes  care  of  all  the  interface  specifications,  the  names  for  instantiations  of  the 
classes,  the  assignment,  argument  passing  and  type  correctness. 

Essential  to  abstract  data  types  is  the  primitive  library  that  is  provided  with  the  compiler.  Here 
typical  abstract  data  types  as  arrays,  AVL  trees,  bags,  and  dictionaries  are  provided  from  which  the 
programmer  can  develop  new  abstract  data  types  particular  to  the  application.  Inheritance  is  important  in 
that  new  abstract  data  types  that  are  defined  are  based  on  the  properties  defined  in  a  primitive  abstract  data 
type.  As  a  matter  of  fact  the  abstract  data  types  are  usually  arranged  in  a  hierarchical  tree  so  that  an 
abstract  data  type  inherits  all  the  properties  defined  between  it  and  the  root  of  the  tree. 

Abstract  data  types  are  the  means  by  which  the  human  can  transform  problem-domain  concepts  into 
the  computer-domain  model.  In  other  words,  the  separation  of  specification  and  implementation  is  the 
desired  result.  The  goal  is  that  program  correctness  at  the  abstract  level  can  be  ascertained  before  the 
implementation.  The  phrases  "abstract  data  types"  and  "object-oriented  programming  have  been  used  in 
various  contexts,  from  Simula  and  its  derivatives  such  as  Ada  to  powerful  data  description  languages  used 
in  knowledge  representation.  The  meaning  we  apply  is  in  the  Smalltalk-80  context.  [Cox84] 

3.2  Object-Oriented  Programming 

Object-Oriented  Programming  replaces  the  operator-operand  concepts  that  are  used  in  the  traditional 
computer-domain  model.  The  idea  is  to  introduce  a  coordination  tool  that  supports  change,  reusability,  and 
enhancements.  The  goal  is  to  transfer  work  from  the  human  to  the  machine  and  to  enhance  consistency 
from  the  human  viewpoint. 

Two  major  concepts  of  Object-Oriented  programming  are  encapsulation  and  inheritance. 
Encapsulation  is  an  aid  in  using  the  system  and  isolates  the  objects  from  the  environment  except  through  a 
carefully  controlled  interface.  Inheritance  is  a  aid  to  building  the  system.  New  classes  are  defined  by  first 
inheriting  the  data  and  behaviors  of  older  generic  clas.ses,  then  specifying  only  how  the  new  ones  differs. 
The  idea  is  to  define  the  data  abstractions  so  the  programming  task  is  made  minimal. 
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Now  we  will  define  some  terms  used  in  a  Message-Object  programming  language  such  as 
Smalltalk-80.  The  terms  object,  message,  class,  instance,  and  method  are  all  defined  in  terms  of  each  other. 
We  will  relate  the  terms  to  the  Objective-C  compiler  that  is  a  derivative  of  Smalltalk-80,  and  will  clarify 
them  by  examples  in  Chapter  5. 

3.2.1  Objects 

Objects  are  virtual(computer-based)  machines.  They  have  some  data  (private  part),  a  set  of 
operations(shared  part),  and  a  run-time  mechanism  for  selecting  operations  on  the  data  that  are  activated  by 
a  message  sent  to  the  object.  They  exhibit  one  of  tlieir  behaviors  when  they  receive  a  message. 

3.2.2  Messages 

Messages  are  sent  to  objects  and  are  requests  to  obtain  a  desired  result.  The  message  contains  a 
predefined  operation(method)  to  be  done  on  the  data  structure  and  are  serviced  one  at  a  time  by  the  object 
Objects  representing  numbers  have  arithmetic  operations;  objects  representing  data  structures  as  AVL  trees 
create  an  empty  tree,  add,  delete,  modify,  or  count  elements. 

3.2.3  Classes 

A  class  represents  a  description  of  a  group  of  similar  objects.  A  class  is  the  abstract  data  type  and  an 

I 

object  is  an  instantiation  of  it  For  example  the  class  rectangle  deals  with  the  generic  group  of  rectangles, 
but  an  instance  of  class  rectangle  will  have  specific  dimensions  for  length  and  width.  Binding  is  done  at 
run-time  so  there  is  no  static  type  checking  at  compile  time.  An  example  would  be  the  class  Array  in 
Objective-C.  The  subclasses  BytArray,  IdArray,  and  IntArray  inherit  properties  from  class  Array.  Hence 
an  operation  as  printOn  defined  in  class  Array  will  work  on  any  of  the  three  subclasses  mentioned, 
although  the  data  representations  differ  in  terms  of  byte.  Id,  or  integer.  Also  a  new  subclass  defined  later 
will  also  be  handled  correctly,  and  class  Array  does  not  have  to  be  revised  to  make  considerations  for  the 
new  subclass  data  type.  This  is  how  reusability  in  data  abstractions  becomes  a  major  asset  in  software 

development 

i 
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A  method  is  a  description  of  how  to  do  an  operation  and  is  specific  to  the  class  in  which  it  has  been 
defined.  It  resembles  procedures  and  could  use  class  variables  as  parameters.  Methods  are  written  in  a 
high  level  language  like  SmalltaIk-80,  Lisp,  or  C.  The  set  of  methods  should  include  all  the  operations 
needed  to  work  with  the  encapsulated  data,  either  via  inheritance  or  definition  within  the  class. 

3.3.  Benefits  of  Object-Oriented  Software 

One  basic  caveat  of  object-oriented  software  is  the  concept  of  reliable  reusable  code.  As  a  matter  of 
fact  the  classes  are  called  IC’s  from  the  engineering  concept  of  integrated  circuits.  To  start  with,  one  uses 
a  set  of  basic  classes  that  form  the  root  of  the  inheritance  tree  that  can  be  systematically  augmented  by 
defining  new  classes. 

To  further  understand  the  problem  we  are  addressing,  let  us  look  at  the  Goldstein  and 
PrettymantGol85]  analysis  of  bibliographic  sources  from  four  different  bibliographic  citation  databases: 
MEDLINE,  INSPEC,  ISIC,  and  COMPENDEX. 

[MEDLINE]  Ann-Trop-Paediatr.  1983  Dec.  17-18.  3(4).  P  197-2(X). 

[INSPEQ  LASER  FOCUS  (USA).  VOL.19,  N0.8.  61-6. 

[ISIC]  COMPUTER  9(3):1 1-12 

[COMPENDEX] 

a)  Electronics  v  56  n  7  Apr  7  1983  p  155-157. 

b)  IEEE  Trans  Magn  v  Mag- 14  n  5  Sep  1978,  INTERMAG  (Int  Magn)  Conf, 

Florence,  Italy,  May  9-12  1978  p  964-965. 

The  parsing  of  the  citation  source  is  a  major  task  in  arriving  at  the  information  in  the  canonical  form 
suggested.  It  cannot  be  automated  fully,  and  is  iterative  due  to  inconsistency  in  the  data,  addition  of  new 
words  to  the  authority  dictionaries,  and  new  valid  acronyms,  entries  and  words. 
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Goldstein  and  Prettyman  give  an  accompanying  parsing  structure  for  each  of  the  above  citation 


[MEDLINE]  [tide].*[year]*[month].*[day(s)].*[vol]([issue]).*P'''[pages]. 


[INSPEC]  [title]([country]).'*'VOL.[volume],*NO.[number].*[pages]. 


[ISIC] 


[title]  ♦  [volume]([issue]):  [pages] 


[COMPENDEX] 


a)  [title]*v*[volume]*n*[issue]*[month]*[day(s)]*[year]*p*[pages] 


b)  [tide]  *v*[volume]*n*  [issue]  *  [month]  *  [days]  *[year],* 


[conf.  name],*[city],*[country],*[month]’'‘[day(s)]* 


[year]*p*[pages]. 


One  observes  there  are  classes  that  are  common  to  the  different  sources.  As  a  matter  of  fact,  the 
tasks  involved  in  processing  for  data  consistency  of  tide,  volume,  and  date,  are  similar  regardless  of  the 
database  origin  or  the  citation  source.  There  may  be  variadons  in  case,  punctuation,  abbreviadons,  and/or 
format.  We  see  date  specified  as  Sep  1978  or  May  9-12  1978  in  the  COMPENDEX  sources.  The  goal  of 
this  project  is  to  develop  some  prototype  classes  that  augment  the  set  of  generic  classes  to  provide  the 
abstract  data  types  needed  to  produce  data  consistency  in  citadons  from  heterogeneous  bibliographic 


databases. 


1 
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Chapter  4: 


Prototype  Development  Environment 


This  chapter  describes  the  physical  hardware  and  software  methods  used  to  implement  the  prototype 
classes  to  process  heterogeneous  bibliographic  citation  databases  into  a  consistent  form. 

4.1  Computer  System 

The  work  was  started  on  the  LLNL  Engineering  Research  Division  (ERD)  VAX  11/780  using  the 
VMS  operating  system  since  it  was  the  only  installation  with  the  Objective-C  compiler  at  LLNL  at  the 
time.  The  parser  development  using  the  Unix  tools  LEX  and  YACC  was  done  on  the  Tektronix  6205 
workstation.  The  parser  modules  were  sent  over  the  network  to  the  VAX  to  be  compiled  by  Objective-C 
along  with  the  prototype  class  modules  to  minimize  use  of  the  resources  on  the  VAX.  With  limited  system 
resources  on  the  ERD  VAX,  the  work  was  later  completed  on  the  LLNL  Technology  Information  System 
(TIS),  which  meanwhile  acquired  the  Objective-C  compiler.  Their  VAX  1 1/780  uses  the  UNIX  operating 
system  BSD  4.2;  certain  VMS  program  lines  needed  for  compatibility  with  Objective-C  were  removed.  In 
general  the  environment  was  simpler  for  development  work  since  the  VMS  port  for  the  Objective-C  was 
still  in  progress  whereas  the  port  for  Unix  BSD  4.2  was  complete. 

4.2  Software  Development  Tools 

The  Objective-C  compiler  from  Productivity  Products  International  in  conjunction  with  the  C 
compiler  was  used  to  implement  the  Object/Message  model  prototype  for  bibliographic  citation  databases. 
The  Unix  tools  Lex  and  Yacc  were  used  to  develop  the  parser  generator,  and  the  tool  Make  aided  in 
software  development.  [PPI85] 


-21- 


4.2.1  Objective-C  Compiler 


The  Objective-C  compiler  is  based  on  the  Smalltalk-80  Message/Object  Model.  The  syntax  for 


developing  classes  in  Objective-C  resembles  die  Smalltalk-80  language  but  differs  significantly  in  that  the 


class  methods  are  defined  using  the  C-language.  The  Objective-C  compiler  is  a  preprocessor  that  produces 


C  source  that  is  then  compiled.  The  preprocessor  produces  Class  and  Phylum  files  that  are  information 


repositories  and  form  the  basis  for  inheritance  and  encapsulation  for  the  classes. 


Smalltalk-80  is  the  result  of  14  years  of  research  and  development  by  the  Software  Concepts  Group 


at  Xerox  PARC.  It  is  based  on  a  software  environment  contained  entirely  within  a  workstation  with  special 


hardware  to  improve  performance  by  orders  of  magnitude.  The  Smalltalk-80  environment  solely  uses  the 


Smalltalk-80  language  and  provides  the  software  person  with  a  repertoire  of  basic  classes.  The 


environment  includes  utilities  usually  provided  by  the  computer  operating  system,  such  as  the  text  editor. 


compiler,  and  debugger.  The  environment  makes  extensive  use  of  graphics  windows,  pull  down  menus. 


and  a  pointing  device  so  the  user  can  work  on  several  views  of  his  work  in  progress.  To  change  text  under 


software  development,  the  user  points  at  the  line,  edits  it,  issues  the  compile  command,  removes  syntax 


errors,  tests  the  software,  and  then  compiles  and  links  the  new  software  into  the  system.  All  this  is  done 


without  changing  "modes"  for  editing,  compiling,  filing  or  executing. 


The  Objective-C  compiler  is  different  in  that  it  is  one  of  the  many  tools  the  software  developer  can 


add  to  the  utilities  offered  by  the  operating  system.  It  is  available  in  the  VAX  VMS  operating  system 


environment  as  well  as  computer  systems  with  the  Unix  BSD  4.2  operating  system.  It  is  is  a  preprocessor 


to  the  C  compiler  and  adds  the  basic  Smalltalk-80  concepts  of  classes,  objects,  messages,  encapsulation  and 


inheritance.  Objective-C  is  an  object  oriented  programming  language  layered  on  top  of  C  and  allows  one 


to  use  it  in  addition  to  the  existing  software  and  hardwtu^e. 


A 


i 


PS 

Is 


I 


at 


% 


Diagram  of  Compilation  Units[PPI85] 
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Objective-C  Class  Libraries 


Included  with  the  Objective-C  compiler  package  are  the  Basic  Class  Library  and  the  Foundation 
Class  library  that  establish  the  root  of  the  hierarchy  of  reusable  classes  from  which  classes  for  the  specific 
application  are  developed.  Classes  developed  for  the  application  inherit  properties  of  classes  between  the 
root  and  itself.  The  hierarchy  of  classes  provided  with  the  Objective-C  compiler  are  presented  graphically 
in  appendix  A. 

The  Basic  Library  contains  the  classes  Nil,  Object,  Array,  IdArray  and  String.  The  root  of  the 
inheritance  hierarchy  is  class  Object  that  points  to  the  Nil  class.  Every  object  inherits  all  the  methods  and 
instance  variable  available  in  class  Object.  Class  Array  is  detailed  to  give  an  idea  of  the  methods  this  class 
supports.  Array  is  a  superclass  of  several  classes  that  support  indexed  instance  variables.  It  has  an 
instance  variable  capacity  that  records  the  units  of  elements  of  the  array.  Methods  are  defined  for  instance 
creation  with  n-elements  that  may  be  initialized  from  an  argument  list  or  not.  Methods  are  also  defined  for 
copying,  inquiring  on  capacity,  printing  to  an  I/O  device,  comparing  and  hashing,  and  notifying  on  bounds 
violations. 

The  Foundation  Library  contains  the  classes  Assoc,  AVLDict,  AVLTree,  Bag,  BytArray,  CItn, 
Dictionary,  IntArray,  Cltn,  Dictionary,  IntArray,  OrdCltn,  Point,  Rectangle,  Sequence,  Sets,  Stack  and 
Unknown. 
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The  implementer  of  an  Object/Message  application  must  be  familiar  with  the  available  classes  to 
appropriately  use  the  inheritance  properties  inherent  in  the  class  hierarchy.  In  the  prototype 
implementation,  the  class  Object  was  used.  In  the  discussion  of  future  work  in  Chapter  7,  the  development 
of  other  classes  are  described  to  support  the  task  of  creating  consistency  in  the  heterogeneous  bibliographic 
citation  database. 

4.22  Unix  Tool:  Lex 

The  Unix  tool,  Lex,  is  a  program  or  module  generator.  The  basic  model  for  Lex  is  based  on  the 
theory  of  regular  expressions[Aho74].  It  generates  a  module  that  is  a  deterministic  finite  state  automaton. 
The  input  to  Lex  is  based  on  user  specified  rules  diat  are  in  the  form  of  regular  expressions.  Regular 
expressions  are  rules  for  specifying  character  strings  to  be  matched  and  include  operator  characters  to 
account  for  repetition  of  strings,  optional  or  required  occurrences  of  strings,  and  the  ordering  of  strings. 
The  user  may  associate  a  procedure  with  a  rule  so  furtlier  processing  is  done  when  a  rule  is  matched.  For 
example,  if  a  rule  in  the  form  of  a  regular  expression  expects  a  number,  the  associated  procedure  may 
verify  that  the  number  is  in  an  expected  range  and  flag  an  error  if  it  is  not  valid[Les75].  Lex  generates  the 
module  that  does  lexical  analysis  on  the  input  character  stream  consisting  of  the  detail  information 
associated  with  a  data  tag  in  a  bibliographic  citation.  The  tokens  and  optional  values  are  passed  to  the 
parser. 

4.2  J  Unix  Tool:  Yacc 

Yacc  is  a  tool  that  generates  a  program  or  module  called  the  parser.  Yacc  is  based  on  Context  Free 
Grammars  using  Backus-Naur  Form(BNF)  de.scriptors  to  specify  tlie  parser  that  accepts  the  language.  The 
formal  discussion  is  found  in  [Aho74]  and  a  user’s  manual  in  (Joh75].  The  input  to  Yacc  are  user  specified 
grammar  rules  and  optional  procedures  to  be  invoked  when  the  grammar  rule  is  recognized.  The  parser 
includes  a  call  to  the  lexical  analyzer  that  pas.ses  tokens  and  optional  values  recognized  from  the  input 


character  stream. 


The  parser  does  a  syntactic  analysis  and  docs  the  associated  actions  if  the  input  satisfies  the  grammar  rule. 
For  the  prototype  the  grammar  rules  include  all  llie  legal  variations  in  the  detail  information  for  a  data  tag 
in  a  bibliographic  citation. 

4.2.4  Unix  Tool:  Make 

The  Unix  tool  Make  is  a  software  management  tool  that  allows  dependencies  to  be  specified  by  the 
user  among  software  modules.  Changes  to  a  source  file  are  automatically  detected  and  trigger  the 
appropriate  actions  specified  in  the  dependency  rule.  For  example,  modifications  to  a  source  file  could 
trigger  recompilations  of  other  dependent  source  files. 

4.3  Summary 

The  software  prototype  was  developed  in  the  Unix  BSD  4.2  software  environment,  using  the  Unix 
tools  Lex,  Yacc,  Make  and  the  Objective-C  compiler.  The  C  compiler  was  used  to  develop  the  software. 
The  next  chapter  discusses  implementation  of  the  prototype  and  how  the  tools  are  used  in  the 


5.1  Sources  of  Data 


The  source  of  data  could  be  the  result  of  a  session  by  a  user  at  a  terminal  making  queries  of  an  on¬ 
line  system  such  as  the  Dialog  system  that  involve  the  search  of  bibliographic  citations  on  a  topic.  The 
output  is  usually  in  the  form  of  a  display  of  the  reu^ieved  citations  and  may  be  followed  by  a  more  complete 
printout  of  the  citations.  In  our  case,  the  facilities  at  the  LLNL  Technology  Information  System  (TIS)  were 
used  to  obtain  bibliographic  citations  on  the  subject  of  "Computer  Gateways  and  Networks"  from  the  six 
following  on-line  database  services:  DTIC/DROLS-TR,  DIALOG  NTIS  FILE  6.  BRS,  DOE/RECON, 
NASA/RECON,  and  SDDLIBRARY  and  INFORMATION  SCIENCE  ABSTRACT.  An  on-line  session 
with  each  particular  database  service  was  used  to  capture  the  information  into  a  local  file.  The  citations  in 
the  local  file  was  translated  into  the  TIS  standard  form  for  bibliographic  citations.  The  six  local  files  were 
then  merged  into  a  single  file  so  that  post-processing  analysis  could  be  done  on  a  single  file.  A  sample  of 
the  merged  file  is  included  in  Appendix  D. 

Each  bibliographic  citation  consists  of  an  average  of  twenty  fields  of  information.  Each  field  begins 
on  a  new  line  and  consists  of  a  data  tag  delimited  by  left  and  right  angle  brackets  (<,>)  and  ending  with  the 
descriptive  information.  In  database  terminology,  one  can  consider  the  data  tag  as  a  field  label  and  the 
descriptive  information  as  the  field  detail. 

5.2  Reformatting  the  Detail  Information  for  Con.si.stency 

On  closer  examination  of  the  bibliographic  citations  in  the  merged  file  one  finds  similar  types  of 
information  may  be  represented  in  differing  formats  if  they  come  from  different  database  sources.  There 
may  be  varying  formats  within  a  database  for  items  coming  from  different  publication  types.  For  example, 
"<DATE>  1985."  appears  in  a  BRS/National  Library  of  Medicine  Database  record,  whereas,  "<DATE> 
Aug  1984”  appears  in  a  DIALOG  NTIS  FILE  6  citation.  Another  problem  is  that  "<’nTLE>  PLURIBUS 
SATELITE  IMP  DEVELOPMENT  MOBILE  ACCESS  TERMINAL  NETWORK"  appears  in  upper-case 
in  the  DTIC/DROLS-TR  citation  but  "<T1TLE>  An  on-line  directory  of  databases  for  material  properties" 
appears  in  lower  case  except  for  the  first  word  in  the  NASA/recon  citation  database.  One  can  make  the 
observation,  however,  that  similar  "classes"  of  information  occur  in  bibliographic  citations. 
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The  task  of  reformatting  the  detail  information  for  consistency  is  a  complex  job.  The  detail 
information  from  different  database  sources  may  appear  with  a  different  data  tag.  An  example  is 
"<TITLE>  Post-processing  of  Bibliographic  Citations  from  DOE/RECON,  NASA/RECON,  and 
DOD/DROLS.  Revision  1."  from  the  DIALOG  NTIS  FILE  6  whereas  the  same  citation  in  the  DOE/recon 
database  has  "<TITLE(MONO)>  Post-processing  of  Bibliographic  Citations  from  DOE/RECON, 
NASA/RECON,  and  DOD/DROLS.  Revision  1."  The  task  of  consistency  may  include  a  cross  correlation 
of  information.  If  the  title  is  not  available  with  the  <T1TLE>  data  tag,  the  information  may  be  available 
with  the  <'nTLE(MONO)>  data  tag.  Hence  a  duplicate  may  be  detected  and  removed.  Typically,  one 
may  request  a  yearly  count  of  articles  written  on  a  subject  to  ascertain  the  emerging  importance  of  research 
in  the  area.  We  pointed  out  in  Chapter  2,  they  estimate  that  thirty-five  percent  of  the  bibliographic  citations 
are  duplicates[Hal83]  and  so  the  accounting  of  duplicates  is  important 

5.3  Program  Design  Abstractions 

Consider  the  merged  file  as  a  data  abstraction  called  in-stream,  and  the  data  abstraction  called  out- 
sueam  that  will  contain  bibliographic  citations  in  a  consistent  format.  We  will  need  procedural 
abstractions  that  indicate  when  in-stream  is  empty,  or  determine  the  next  data  tag  and  data  field  pair.  We 
can  consider  each  data  tag  and  data  field  pair  as  an  abstraction.  Hence,  we  can  arrive  at  abstract  data  types 
for  "date",  "title",  "author",  and  etc.  that  are  based  on  the  data  tags  found  in  the  merged  file. 

The  <DATE>  abstraction  is  presented  with  details  for  its  implementation.  The  bibliographic  data 
tags  such  as  <DATE>,  <AUTHOR>,  or  <T1TLE>  are  handled  as  left  context  operators.  They  trigger 
environments  that  are  very  dissimilar.  On  clo.ser  examination,  the  information  associated  with  <TITLE>  is 
considered  as  a  string,  whereas  the  information  as.sociated  with  <DATE>  is  considered  on  a  word  basis, 
where  a  word  is  any  nonempty  sequence  of  alphanumeric  characters.  Adjacent  words  may  be  separated  by 
non-alphanumeric  characters  as  space,  punctuation,  or  newline.  Hence  the  lexical  rules  and  actions  must 
be  specified  separately  for  these  two  different  environments.  In  looking  at  the  <AUTHOR>  and  <DATE> 
detail  information,  the  parser  rules  and  actions  must  be  specified  individually  also.  An  author  June  E. 
Smith  has  a  first  name  of  "June",  whereas  June  should  be  handled  as  the  sixth  month  if  it  is  a  date. 
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A  discussion  on  handling  of  left  context  sensitivity  is  described  in  the  Lex  reference[Les75].  Once  the  data 
tag  has  been  identified,  then  separate  lexical  and  parser  routines  associated  with  Lex  and  Yacc  rules  are 
called  to  process  the  information.  We  can  think  of  Lex  and  Yacc  as  procedural  abstractions  in  the 
development  of  our  prototype  class.  The  Unix  tools  Yacc  and  Lex  produce  C  modules  of  advanced 
algorithms  in  a  convenient  form  that  can  be  easily  integrated  into  the  prototype  application  program.  These 
program  generators  do  special  jobs  based  on  ufier  specifications  that  are  easy  to  update.  Yacc  produces  the 
module  "yyparse"  and  Lex  produces  the  module  "yylex".  The  user  can  insert  C  code  before,  within,  and 
after  the  call  to  either  module  to  add  a  large  amount  of  flexibility.  The  modules  generated  are  special 
purpose  and  have  excellent  performance  in  terms  of  time  and  space.  They  save  the  user  from  writing  their 
own  C  code  and  hence  frees  the  programmer  from  details  that  are  conceptualized  as  procedural 
abstractions. 

5.4  The  Prototype 

To  show  the  ease  in  creating  Objective-C  classes,  tlie  prototype  for  the  Date  class  is  described.  The 
prototype  consists  of  the  Lex  and  Yacc  specification  files,  the  Date  class  data  abstraction,  and  the  main 
program  module.  The  tutorials  on  Lex  and  Yacc  were  helpful  in  developing  the  specification  files[Bel78]. 
5.4.1  Lex  Specification  File 

The  general  format  of  Lex  input  is: 

(definitions) 

%% 

(rules) 

%% 


(user  routines) 


#include  "objc.h' 


#include  "y.tab.h" 

fdefine  MON(x)  {  yylval.lex=x;  return  MONTH;  } 
=(N,Collection,Primidve) 


The  include  file  "objc.h"  contains  most  of  the  standard  definitions  for  the  user  of  the  Objective-C 
compiler.  The  file  contains  various  C  types  such  as  STR  for  string,  SEL  for  selector,  BOOL  for  boolean, 
lOD  for  I/O  descriptor  and  SHR  for  the  shared  part  of  an  object.  The  include  file  y.tab.h  is  created  by  Yacc 
and  contains  the  tokens  used  for  communication  between  the  lexical  analyzer  and  the  parser.  The  macro 
MON(x)  is  defined  to  assign  a  value  to  yylval.lex  that  is  returned  to  the  parser.  Values  returned  by  the 
lexical  analyzer  and  associated  action  procedures  are  integers  by  default  The  rules  to  Yacc  can  define 
other  types  that  the  parser  tree  handles  so  the  stack  properly  carries  out  the  reduce  and  shifts  to  determine 
an  accepting  state  for  the  statement  being  parsed.  The  Yacc  discussion  covers  the  union  of  types  that 
account  for  the  suffix  ".lex".  The  last  statement  is  an  Objective-C  declaration  for  the  Phyla  files. 

The  rules  section  consisting  of  regular  expressions  is; 


[jj]an("."|uary)?  MON(l); 


[dD]ec("."|ember)?  MON(12); 
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{yylval.lex  =  yytext[0]  -  ’0  ;  return  DIGIT;} 


{ ;  /  *  delete  blanks  */ } 


{ return  EOL;  } 


{  return  EOL ;  } 


In  the  regular  expression  ’|jj]an("."|uary)?’,  the  months  are  allowed  in  different  forms,  i.  e.  jan,  jan., 


januaryjan,  Jan.,  or  January.  The  macro  MON(x)  is  the  action  statement  where  the  value  returned  is  an 


integer,  that  is  1  for  January,  2  for  February,  and  etc.  The  value  is  stored  in  yylvaLlex,  and  MONTH  is  the 


token  returned.  The  characters  0  through  9  are  recognized  by  the  regular  expression  [0-9]  and  the  action  is 


to  return  the  integer  value  for  the  character  representation  and  DIGIT  for  the  token.  The  regular  expression 


[  ]  deletes  blanks  since  there  is  no  action  statement  The  regular  expression  "\n"  recognizes  end-of-lines 


and  returns  the  EOL  token.  The  regular  expression  recognizes  any  other  character  and  the  action 


statement  returns  the  single  character. 


The  last  section  defines  procedure  "date(month,day,year)"  for  checking  that  the  month  is  in  the  range 


1-12,  and  the  days  for  a  month  are  correct.  The  leap  year  is  taken  into  account  on  the  days  of  a  month. 


Terse  error  warnings  are  included  that  could  be  changed  to  more  sophisticated  error  recovery  actions.  See 


Appendix  C  for  the  details.  Hence  the  lexical  analyzer  module,  yylex,  should  be  able  to  recognize  the 


tokens  in  the  eight  variations  for  "date"  that  are  tabulated  in  Chapter  2. 


5.4.2  Yacc  Specification  File 


We  now  describe  the  specification  file  that  is  input  to  Yacc  to  generate  the  module  yyparse.  The 


general  form  looks  like: 


declarations 


programs 


The  declaration  section  is: 

%{ 

#include  "objc.h” 

=  (N,  Collection,  Primitive) 
extern  id  dateObj; 

%} 

%union  { 

short  lex; 
id  obj: 

} 

%Start  prog 

%token<lex>  DIGIT  MONTH 
%token<lex>  EOL 


%type<lex>  number  year  day 
%type<obj>  DateStmt 

In  the  declaration  section  we  have  the  include  file  objc.h  and  the  phyla  declaration  that  were 
described  in  the  previous  section  on  Lex.  The  external  declaration  of  the  instantiation  of  the  Date  class, 
dateObj,  is  required  since  dateObJ  is  created  in  the  main  program.  The  union  statement  defines  the  two 
data  structures  on  the  parser  tree,  the  "lex"  integer  data  structure  and  the  Objective-C  "obj"  id  dau 
structure.  The  goal  symbol,  prog,  is  defined  by  the  %Start  statement,  and  the  legal  lexical  tokens  that  yylex 


i 


recognizes  are  DIGIT,  MONTH,  and  EOL.  Number,  year,  and  day  are  parsed  by  yyparse  and  have  the 
"lex"  integer  data  structure.  The  DateStmt  has  the  "ubj"  id  data  structure. 

The  rules  section  is: 

%% 

prog:  DateStmt  EOL  { exit  ();}  ; 

DateStmt:  MONTH  day  year 

{ 

date  ( $1,  $2,  $4 ); 

$$  =  [dateObj  mo:  $1  da:  $2  yr;  $4  ]; 

[dateObj  print] ; 

} 

I---} 

day:  number; 
year:  number; 

number  DIGIT  |  number  DIGIT  {$$  =  10  *  $1  +  $2;  }; 

The  rules  section  specifies  the  BNF  grammar  for  parsing  the  legal  forms  of  date.  The  date  procedure 
checks  that  the  number  of  days  is  within  the  correct  range  for  the  month,  with  leap  year  taken  into 
consideration. 

The  following  statement: 

$$  =  [dateObj  mo:  $1  da:  $2  yr:  $4]; 

stores  the  month,  day,  and  year  values  in  the  object,  dateObj.  The  Objective-C  message  expression  is 
contained  between  the  pair  of  square  brackets([...l).  The  message  is  sent  to  the  receiver,  dateObj.  There 
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are  three  keyword  selectors,  mo,  da,  and  year,  that  consist  of  a  string  of  characters  ending  in  a  colon 
character.  The  arguments  to  the  keyword  selectors  are  SI,  $2,  and  $4  that  are  obtained  from  the  parse  tree. 
This  is  an  invocation  of  a  method  defined  in  the  Date  class  and  is  a  behavior  in  addition  to  the  instance 
methods  that  Class  Date  inherits  from  the  Object  Class. 

[dateObj  print]; 

The  print  method  is  defined  in  the  Date  class  and  defines  a  behavior  for  printing  the  values  stored  in 
the  dateObj  object  for  month,  day  and  year.  The  user  simply  invokes  the  print  method  and  is  not 
encumbered  by  the  details  of  the  data  structures  of  month,  day,  or  year  to  print  the  information  correctly. 
In  contrast,  the  Fortran  programmer  must  know  whether  the  month,  day,  or  year  may  be  in  ascii,  octal,  or 
integer  format  to  select  the  proper  conversion  specification  in  the  "Format"  statement.  The  proper 
definition  of  the  methods  in  a  class  should  encompass  the  create,  modify,  or  reply  so  that  the  user’s 
requirements  in  working  with  the  class  object  is  complete. 

The  program  section  is  the  last  section  and  contains  an  error  diagnostic  that  prints  a  warning  to  the 
user  if  the  input  can  not  be  parsed  by  the  grammar  rules  contained  in  the  input  specification  file  for  Yacc. 
One  may  observe  at  this  point  how  terse  the  software  is  to  do  all  this  work.  The  extraneous  characters  for 
space,  /,  and  variations  in  the  date  format  are  handled  with  a  minimum  amount  of  software.  The  values  for 
month,  day,  and  year  are  stored  as  instance  variables  into  the  object,  dateObj,  through  the  method  defined 
within  the  class  Date,  and  the  print  operation  is  easily  invoked  since  the  details  are  encapsulated  as  a 
method  in  the  class  Date. 

5.43  Date  Class 

The  Date  class  is  defined  in  the  source  code  file,  "date.m".  The  declaration  section  has  the 
Objective-C  include  file,  objc.h,  and  the  Yacc  include  file,  y.tab.h.  Next,  the  declaration  for  ascii 
representations  for  month  is  included  for  the  print  method. 


The  following  statement: 


=  Date:Object  (N, Collection, Primitive) 

reflects  that  the  Date  class  inherits  properties  from  the  Object  Class,  and  the  Date  class  will  be  included  in 
the  writable  phylum  file  "N".  Also,  the  Date  class  may  use  the  classes  in  the  Objective-C  librar  s 
Collection,  and  Primitive.  The  instance  variable  are  declared  to  be  integer  for  month,  day,  and  year,  and 
are  called  mon,  da,  and  yr  respectively.  The  first  method  prefaced  with  "-mo: ..."  stores  the  values  in  the 
instance  object  The  next  method  denoted  by  "-print ..."  prints  the  date  to  the  terminal.  The  print  method 
will  test  for  the  default  values  of  -1  and  vary  the  printout.  The  three  sample  printout  forms  are: 

1  May  1985 
May  1985 
1985 

5.4.4  Main  Module 

The  main  program  contained  in  the  file,  "main.m”,  begins  with  the  include  file  for  the  C  compiler 
standard  I/O  library,  stdio.h,  and  the  Objective-C  include  file,  objc.h.  The  phyla  declaration  statement  for 
the  main  program  follows.  The  externals  are  declared  in  addition  to  the  instance  object,  dateObj.  The 
main  program  sets  the  output  to  be  the  terminal  that  is  the  Unix  standard  output  device. 

The  statement 

dateObj  =  [Date  new] ; 

creates  the  object  for  the  Date  class.  Since  the  method  "new"  is  not  defined  in  the  Date  Class,  the  method 
is  inherited  from  the  Object  Class.  The  prompt  is  printed  at  the  terminal  and  then  the  input  is  expected 
from  user  at  the  terminal  so  that  it  can  be  parsed  and  have  its  values  for  month,  day,  and  year  stored  into 
the  date  object  just  created.  The  print  method  is  then  invoked  to  verify  the  proper  values  are  stored  in 
dateObj  for  month,  day,  and  year.  The  last  two  statements  declare  the  classes  and  phyla  that  can  be  used  in 
this  application  program. 
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Chapter  6 


Summary  and  Results: 


The  intent  of  the  prototype  implementation  is  to  provide  a  programming  example  of  the  Class  data 
abstraction  mechanism  of  Objective-C  as  applied  to  the  Date  class  to  obtain  data  consistency  in  varying 
forms  of  dates  that  are  contained  in  bibliographic  citations.  Through  a  simple  example,  features  of  the 
abstraction  mechanism  in  Objective-C  have  been  presented.  The  Unix  tools,  Lex  and  Yacc  were  used  to 
develop  the  procedural  abstracdons,  yylex,  and  yyparse,  that  do  the  lexical  analysis  and  syntacdc  analysis 
on  the  varying  date  forms.  Eight  variadons  of  dates  consisting  of  month,  day  and  year  were  established  in 
the  dateObj  object  for  the  Date  class.  With  the  instance  variables  set  to  specific  values,  the  print  method 
could  be  invoked  to  take  care  of  the  task.  The  private  data  and  data  access  methods  are  encapsulated 
within  the  Date  class,  and  requires  that  the  user  communicate  through  messages  to  the  object  to  elicit  the 
behaviors  desired. 

The  Date  class  is  an  elementary  example  to  show  how  other  classes  for  the  bibliographic  citation 
database  can  be  developed  for  accomplishing  data  consistency  in  the  numerous  fields  in  a  bibliographic 
citation.  The  Date  class  can  easily  be  extended  to  included  more  methods,  categorized  as  setting,  inquiring, 
performing  arithmetic  and  printing. 

Setting; 

1.  -setmo:  aMonth  set  the  month 

2.  -setda:  aDay  set  the  day 


3.  -setyr:  a  Year 


set  the  year 


Inquiring: 


1.  -getmo:  aMonth  reply  the  month 

2.  -getda:  aDay  reply  the  day 

3.  -getyr:  aYear  reply  the  year 

Performing  Arithmetic: 

1.  -Julian  reply  the  Julian  day 

2.  -dayofyear  reply  the  nth  day  of  year 

Printing 

1.  -printmo  reply  the  month 

2.  -printdy  reply  the  day 

3.  -printyr  reply  the  year 

The  goal  is  to  develop  a  comprehensive  Date  class  to  simplify  the  task  of  constructing  reliable 
software  that  is  easy  to  understand,  modify,  and  maintain.  This  Date  class  will  be  part  of  the  Class  Library 
that  is  accessed  by  application  programmers  who  will  rely  on  the  skill  of  the  designer  who  develops  the 


abstraction.  The  classes  must  be  defined  such  that  the  behaviors  of  the  class  of  information  is  fully  defined. 
These  include  the  create,  modify  and  reply  operations.  In  the  event  that  additional  behaviors  are  necessary, 
the  concept  of  abstraction  mechanisms  in  the  programming  language  as  Objective-C  will  guarantee  that 
software  will  not  have  to  be  re-examined  or  re-written  because  of  the  change. 

We  briefly  describe  how  the  <AUTHOR>  and  <TITLE>  classes  can  be  defined  and  used  in  the 
application  for  data  consistency  in  heterogeneous  bibliographic  citation  databases.  The  main  program  is 
expanded  to  examine  the  in-stream  of  data  and  look  for  the  "<AUTHOR>"  or  "<TrTLE>''  data  tag.  This  is 
easily  done  since  the  data  tags  are  enclosed  in  the  left  and  right  angle  brackets.  The  characters  following 
the  right  angle  bracket  are  saved  in  a  buffer  until  a  left  angle  bracket  is  detected.  This  buffer  of  characters 
is  then  passed  as  data  input  to  the  parser  developed  for  the  particular  data  tag  information. 
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In  the  TITLE  data  tag  the  Lex  specification  file  will  have  the  action  statement  convert  the  text  to 
upper-case  for  consistency,  and  then  will  store  the  title  into  the  object 

yylval.obj  =  [String  str:  yytext]; 
return  STRING; 

The  Yacc  specification  file  will  contain  the  action  statement: 

$$  =  [titleObJ  str:  $1  ] ; 

In  the  case  of  the  AUTHOR  data  tag,  the  buffer  of  characters  captured  after  detecting  the  Author  tag 
is  passed  to  the  Author  parser  that  has  BNF  specifications  to  handle  the  variations  in  author  names.  The 
author  list  could  be  saved  in  the  Set  class.  The  creation  of  an  Author  object  could  include  an  initialization 
that  would  give  a  wild  card  character  like  for  the  first  or  middle  name  in  cases  where  the  names  are 
missing  from  the  input  stream.  The  methods  defined  for  the  author  class  could  treat  the  names  as  wild 
cards  when  a  match  is  required. 

The  next  logical  development  is  to  define  a  citation  object  that  contains  the  Author,  Title,  and  Date 
Objects  as  a  related  triple. 

extern  id  String,  Set;  id  citationObj; 

citationObj  »  [self  with:  3 
[dateObj  str]; 

[titleObj  str]; 

[authorCjb  str];  ]; 

Methods  could  be  defined  to  create,  add,  delete,  or  modify  a  citation,  in  addition  to  printing  the  citation  in 
"pretty"  forms  for  easy  user  viewing. 


The  prime  idea  in  defining  classes  for  the  heterogeneous  bibliographic  citation  databases  is  to  present  the 
application  programmer  with  abstractions  that  handle  the  data  types  involved,  and  include  all  methods  to 
process  the  abstract  data  types.  Hence  the  objects  are  the  entities  that  are  handled  by  the  application 
programmer  to  reduce  the  details  that  must  be  remembered.  The  particular  class  should  characterize  the 
behavior  of  the  data  entirely.  If  not,  additional  methods  may  be  added  to  the  class  definition.  Indeed,  even 
if  this  is  done,  software  that  has  been  written  based  on  the  former  class  definition  may  not  have  to  be 
rewritten  unless  it  accesses  the  new  features  in  the  class.  The  underlying  physical  structure  of  the  program 
is  taken  care  of  by  the  physical  interfaces  used  by  the  Objective-C  compiler.  The  basic  actions  in 
programming  the  application  are  assignment  statements  that  create  objects  and  invocations  of  class 
methods  through  messages  to  the  objects  to  exhibit  behaviors. 


Chapter  7 


Discussion  and  Future  Directions 


In  recent  years  a  variety  of  powerful  generic  tools  have  been  created.  Database  Management 
SystemsfDBMS)  and  Spreadsheets  are  examples.  They  gain  their  power  from  the  ability  to  operate  on 
various  data.  They  provide  the  generic  operations  of  create,  modify,  and  output  We  have  attempted  to 
create  the  tool  for  data  conversion.  This  study  was  restricted  to  bibliographic  citations  to  see  how  far  the 
idea  of  a  generic  library  tool  can  be  extended.  The  development  of  the  generic  library  tool  requires  the 
definition  of  classes  which  the  application  programmer  incorporates  into  user  software.  The  concept  of 
abstract  data  types  via  classes  can  be  extended  to  Database  Management  Systems.  If  one  considers  the 
relational  model,  then  the  relations  in  the  form  of  tables  can  be  considered  the  data  structure  of  the  class. 
The  operations  of  retrieve,  update,  and  append  with  qualifiers  can  be  considered  the  class  methods.  This 
abstraction  is  a  convenient  one  for  the  application  programmer  since  tables  of  information  are  a  common 
occurrence.  But  a  detail  look  at  the  physical  implementation  of  the  data  structure  may  be  complex.  The 
storage  and  access  mechanisms  may  be  based  on  hashing  algorithms  if  the  data  are  sparse  and  have  a 
balanced  distribution.  B-trees  may  be  used  with  linked  lists  for  fast  searches.  Here  the  user  is  relieved  of 
the  complexities  that  are  left  to  the  Database  Management  System  implementers.  To  access  the  relations 
the  user  relies  on  the  query  language  that  allows  operations  on  the  relations.  In  this  same  regard,  the  person 
developing  the  classes  for  an  Object  Oriented  application  must  provide  the  application  programmer  witli 
the  necessary  classes  to  do  a  job.  The  classes  must  be  general  enough  to  handle  application  programs  that 
have  not  yet  been  defined.  This  is  what  a  good  Database  Management  System  provides,  and  is  what  the 
class  library  for  the  application  should  provide.  Of  course.  Database  Management  Systems  are  always 
being  enhanced  to  do  a  better  job  for  the  user,  and  it  is  expected  that  the  class  library  will  be  improved  with 


lime.  What  is  important  is  that  the  user  will  not  have  to  rewrite  any  software  that  has  been  developed. 


Even  if  the  underlying  physical  structure  is  changed  to  improve  speed  or  space,  the  user  need  not  be 


concerned,  and  all  the  benefiLs  will  be  automatically  gained.  One  can  now  readily  understand  the  strength 


in  using  abstractions.  Through  Object  Oriented  Programming  the  abstraction  mechanism  found  in 


Database  Management  Systems  and  Spreadsheets  can  now  be  extended  to  programming  languages  through 


abstraction  mechanisms  provided  in  languages  like  Smalltalk-80  and  Objective-C. 


This  project  has  demonstrated  the  feasibility  of  establishing  data  consistency  in  heterogeneous 


bibliographic  citation  databases  through  data  abstractions,  called  classes.  Future  work  involves  specifying 


and  implementing  the  full  set  of  classes  for  this  application.  With  the  classes  in  place,  the  application 


programs  can  be  written  to  further  the  data  consistency  goal. 


We  have  discussed  the  bilio-citation  object  consisting  of  the  title,  author,  and  date  objects.  The 


objects  associated  within  the  citation  object  should  be  expanded  to  include  the  necessary  elements  for 


identifying  a  bibliographic  citation.  This  requires  the  establishment  of  a  canonical  form  for  a  bibliographic 


citation.  A  study  of  the  bibliographic  citation  format  from  different  sources  shows  that  the  data  tag  names 


are  diverse  and  many  are  singular.  For  example,  the  DOE/RECON  database  uses  "<PAGE  NO>  17”, 


whereas  the  DTIC7DROLS-TR  has  "<PAGINATION>  30P".  Goldstein  and  Prettyman  have  proposed  a  set 


of  36  fields  for  the  citation  canonical  form  and  it  appears  in  chapter  2.  They  propose  two  character  data 


tags,  such  as  PG  for  the  number  of  pages  in  the  reference.  Their  canonical  form  is  based  on  bibliography 


preparation.  The  data  fields  for  the  general  case  needs  to  be  studied  and  proposed.  On  a  cursory  glance. 


the  expanded  canonical  form  should  include  "AB"  for  abstract,  and  "KW"  for  keyword  descriptors.  We 


note  singular  data  tags  that  probably  are  only  meaningful  to  the  local  bibliographic  database  such  as 


"<LIM1TATI0N  CODES>  1",  can  be  excluded  from  the  canonical  form  of  the  citation.  With  the  data  tag 


and  associated  data  elements  defined  for  die  canonical  form  of  a  bibliographic  citation,  the  definition  of 


classes  for  data  consistency  can  proceed.  The  Date  class  can  be  re-used  for  the  journal  date,  publication 


year,  copyright  year,  and  the  meeting  date.  The  definition  of  a  Location  class  is  appropriate  for  the  meeting 


location,  publication  location,  and  author  location.  Tliis  class  should  access  an  abbreviation  dictionary  to 


produce  a  consistent  form  of  the  location. 


If  the  location  is  listed  as  London,  then  London,  England  should  be  substituted.  The  location  US,  U.S.A., 
or  United  States  should  be  made  consistent  in  die  same  fashion.  Warnings  should  be  included  for  data  not 
found  in  the  dictionary,  so  that  it  may  be  updated  with  new  entries.  The  standardizadon  of  publication 
titles  can  be  added  to  a  Source  class.  Certainly,  the  conversion  for  case  consistency  in  a  character  strings, 
and  the  expansion  of  abbreviadons  should  be  included  in  the  class  methods.  Alternate  names  for  people  or 
insdtutions  could  be  accessible  from  a  dicdonary  to  further  aid  in  data  consistency.  We  note  that  the 
Dictionary  class  is  available  in  Objecdve-C  and  can  be  incorporated  into  an  class. 

A  future  expansion  should  include  the  post-processing  tasks  in  terms  of  the  classes  defined  in  the 
applicadon  tool  library.  Methods  could  be  included  to  "pretty-print  a  bibliographic  citation",  to  analyze 
bibliographic  text,  to  display  the  citations  on  the  CRT  screen,  to  plot  the  statisdcal  informadon  on  a  graph, 
and  to  do  cross-correladons  on  the  data  fields.  The  convenient  tools  of  Unix  can  be  incorporated  into  the 
classes  since  Objecdve-C  is  designed  with  the  use  of  Unix  tools  in  mind.  We  have  seen  how  the  Unix  tools 
Lex  and  Yacc  were  incorporated  into  the  Objective-C  program. 

The  procedure  of  establishing  data  consistency  in  a  heterogeneous  bibliographic  citadon  database 
through  the  definition  of  abstract  data  types  can  be  extended  to  other  heterogeneous  databases.  The 
restriction  is  that  the  informadon  in  the  heterogeneous  databases  derive  from  a  common  base,  as  in 
bibliographic  citadons.  Hence  for  a  relational  database  where  a  reladon  is  employee,  a  field  in  the  reladon 
is  name,  and  its  detail  informadon  is  John  Jones,  the  data  tag  could  be  <employee.name>,  and  the  detail 
field  would  be  John  Jones.  The  existence  of  a  data  tag  and  and  an  associated  detail  field  in  the  database 
establishes  the  reuse  of  the  data  abstracdons  created  for  the  bibliographic  citation  database. 
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Appendix  A 


Hierarchy  of  Objective-C  Classes  -  ©class,  ©phyla  [PPI851[Eag851 
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Ophyla 

Collection 

Primitive 


Dictionary 


Oclaw 

Dicitionary 

IdAmy 

Sequence 

Amoc 

Ophyla 

Collection 

Primitive 


Bag 


Oclau 

Bat 

IdAmy 

Sequence 

IntAmy 

OphylB 

Collection 

Primitive 


Hierarchy  of  Objective-C  Classes  (continued) 


Assoc 

OclMf 

Amoc 

Ophylx 

CcUcctioa 

Pnxzuiiv* 


)Ct 

lui 

Object 

hyU 

Primitive 


Array 


Ocliat 

Amy 

Ophyla 

Primitive 


Point 

Oclast 

Point 

Ophyin 

CcometTy 

Primitivt 

Rectangle 

Qclau 

Rnctanglt 

Point 

OphyU 

Geometry 

Primitive 


Byt  Array 

Qclaee 

BytAmy 

OphyU 

Primitive 

IdArray 

Oclass 

IdAmy 

OphyU 

Primitive 

IntArray 

Oclaes 

IntAmy 

OphyU 

Primitive 


Hierarchy  of  Objective-C  Classes  (continued) 


Appendix  B 


Objective-C  Base  Tree  -  methods  [PPI85][Eag85] 


Object 

frM 

iaitUlis* 

adxVxrSiM 

A«W 

PomAj: 

nadFrom: 


uGnph: 

awakt 

capacity 

ei^ 


compart: 

copy 

dttpCopy 

dttcribt 

dotsNotRtcoffnUt: 

error; 

frtt 

hath 

idOfSTR: 

iiCopyOf: 

iiEqual: 

ttKiadOf: 

iaMtmbtrOf: 

itSamt; 

aamt 

ootEqaal: 

aotlmpitmtnttd 

BotSamt: 

perform: 

perform;  with: 

perform:  with:  with: 

priat 

priatOa: 

priatString: 

reepoadsTo: 

eelf 

•hallowCopy 

•hoaldNotlmpicment 

•how 

•ise 

•toreOn; 

•tr 


AVLTree 

k.y: 

addConttnuTo: 

wldKcytTo: 

find: 

frtt 

inurtlato: 
■•Copy  Of; 
kty 
kty: 

printOn: 


•ubclauRtipontibility 

•uptrClui 
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Objective-C  Base  Tree  (continued) 


I 

K 


Object 

frM 

initUlin 

BdxVuSU* 

MW 

PomA<; 

rMdftom: 

uGnph: 

aw»k« 

cxpacity 

eliM 

eompan: 

copy 

dMpCopjr 

dcterib* 

da«NotR«eo(niM: 

•iron 

frM 

huh 

IdOCTR; 

iiCapyOf: 

iiEqoaJ: 

hKindOf: 

iiM«mb«fO(: 

iiSun*: 

s«mt 

BotEqaab 

aotlmplamtaiad 

aotSama: 

parfona: 

parfonn:  with: 

parfonn:  with:  with: 

priat 

ptiatOa: 

printStriaf: 

raapoadiTo: 

fair 

•hallcwCopj 

ihonidNotlmplamaat 

•how 

•iia 

•toraOn: 

•tr 

•obclutRaapoaubilily 

•uparClua 


Cltn 

aaw 

aaw: 

arith: 

add 

addCoataauOf: 

addContaataTo: 

aaBa( 

aald Array 

aaOrdClta 

aaSat 

<oataia«: 

aaehElamaal 

axpaad 

(ad: 

fraa 

hash 

laCopyOf: 

iaEmpty 

liEqual: 

ohatOf: 

priatCoataBtaOa: 

priatOa: 

laaova: 

raaiovaCoataaUOt; 

•iia 


AVLDict 

add: 

■ddCoataauTo: 
atldAnay 
atKay: 
atKay:  pat: 

.  coataiaa: 

(ad: 

iaCopyOf: 

kayi 

priatCoBtaattOa: 

nmova: 

■Ua 

aaluas 

OrdCltn 

add: 

addCoataauTo: 

at: 

bouadtEnot: 

Sad: 

(adMatchiap: 

(adSTR: 

(ntElaoaat 

UCopyOf; 

laaiQtmaat 

laatladax 

ramova: 

•ixa 


Set 

ftdd: 

addConttnuTo: 

contain*: 

diffcrtnct: 

txpand 

ftJtor: 

find: 

flndEltmontOrNil: 
inUrfoction: 
occarrtncttOf; 
roBOvt: 
rtpiact: 

•»* 
union: 


stack 

add: 

dtpth 

•mptyErr 

itCopyOf: 

lutElament 

pop 

puah; 

•iia 

•wap 


m 


Dictionary 

with: 

aiaociatioaAt: 

atKay: 

alKay;  put: 

iacludtaAuocialioa: 

iaciudaiKay: 

ktyi 

valuaa 


add: 

add:  withOccurraacaa: 

axpaad 

fraa 

iacluda*: 

occamncaaOf: 

priatContaattOa: 

ramova: 


Objective-C  Base  Tree  (continued) 


Object 

frM 

initlaUx* 

ndxVarSUt 

new 

POmA<: 

ratdl^om: 

uCnph: 

»nlw 

capacity 

com  pan: 
copy 

dooaNotRocogniio: 

dnpCopy 

doacribo 

•non 

In* 

hath 

idOISTR: 

bCopyOf: 

ifEqtial: 

bKiodOf; 

iaMemberOf: 

kSamt: 

nam* 

potEqaak 

BOtlmplcmaatad 

BotSamo: 

parfona: 

ptrfonn:  with: 

poifonn:  with:  with: 

print 

printOn: 

printStrinf: 

nfpoadtTo: 

••If 

•how 

•haliowCopy 

•hoaidNotlmpiomant 

•iia 

•tonOn: 

•tr 

•ubclatsRaiponiibility 

•aporClan 


Array 

adnVarSiu 

BdzVarTypo 

•ith: 

••Id  Amy 

boaadtViolatioa: 

capacity 

capacity: 

copy 

d«acribo 

haah 

itCopy; 

iaEqaal: 

priatCoaUntfOa: 

priatOa: 

•iao 

•ort 


BytArray 

ndxVarSiio 

ndxVarTypa 

bow: 

•priatf: 

•tn 

ajlat 
••Float 
••LoBg 
charAt: 
charAt:  pat: 

~  compan: 
companSTR: 
coBcat: 
coBcatSTR: 
deacribt 
hath 

UCopyOf; 

ioEqual; 

UEquaiSTR: 

priatCoaUattOn: 

•OR 

•tr 

•tr: 

IdArray 

adxVarSiio 

ndxVarTypo 

with: 

add: 

addContcntsOf; 

addCoatcnttTo: 

al: 

at;  put: 

contaiiMt 

dttcrib* 

•achElemoat 

Sad; 

fladMatchinf; 

frtoCoatoBM 

haah 

iaEqaai: 

offaotOf; 

offaatMatchini; 

olfiotMatchiafSTR: 

printContentaOa; 

romora; 

•in 

•OR 

IntArray 

odxV»rSise 

ndxVxrTyp* 

describe 

huh 

intAt: 

intAt:  put: 

intAt:  Add: 

iiCopyOf; 

isEquxJ; 

printConicntsOn: 


Objeclive-C  Base  Tree  (continued) 


f 

! 

I 


Object 

frM 

taitUlis* 

BdxVwSin 

PomAk 

mdFtoni: 

uCnph: 

capacity 

clai* 

eompan: 

copy 

dociNotlUcafaiM: 

docpCopy 

dMcribo 

error 

fro* 

hack 

tdOSTR: 

iaCopyOf: 

iaEqul: 

iaKiadOf: 

iaMasibarOf: 

itSania: 

aaffla 

ootEqoal: 

nollmplamaatad 

aotSaoa; 

parfonn: 

porform:  witk: 

porform:  with:  with: 

print 

priatOa: 

priatStrinc: 

raapoadaTo: 

aatf 

ahow 

ahailowCopy 

ahoaldNotlmpicmtat 

aiia 

aloraOn: 

atr 

fabciaaaRaaponiibility 

auparClaaa 


Assoc 

hay: 

hay:  valaa: 

compara: 

kaak 

iaEqaal: 

kay 

hay: 

ptiatOa: 

atr 

aalaa 

aalaa; 

Point 

fromUaar 
a:  y: 

diet: 

dot: 

iaAboaa: 

iaBalow: 

iaCopyOf: 

iaEqnat: 

iaUIt: 

iaRight: 

haah 

laagth 

auaaa: 

movaBy: 

movaBy:  a: 

moaaTo: 

plaa: 

priatOa: 

limaa: 

a 

a: 

a:  y: 

r 

r- 


Objective-C  Base  Tree  (continued) 


Object 

frM 

laitUliM 

ndxVaiSiM 

MW 

PomA*: 

nadfYom: 

atCnph: 

&wxk* 

capacity 

cIm 

compara: 

copy 

dMpCopy 

dtacribo 

doosNotRaeognixa: 

•rror 

&M 

hath 

idOSTR: 

iaCopyOf: 

iaEqoal: 

iaKindOf: 

IiMombtrOt: 

iaSamt: 

namt 

aotEqaai; 

notlmplcmaatad 

notSama: 

parform: 

parfona:  with: 

pariona:  with:  with: 

priat 

priatOn: 

prints  triag: 

raapondaTo: 

aalf 

ahallowCopy  * 

ahouldNotlsipleaiaat 

•how 

aiia 

atoraOa: 

atr 

anbclanRcaponaibility 

faparClaia 


Rectangle 

^mUitr 

a«w 

orifia:  eontn 
origin::  comer:: 
origin:  extent: 
origin::  extent:: 


bottom 

bottom: 

bottomCenter 

bottomLeft 

bottomRight 

center 

ceaterX 

centerY 

conteiac: 

comer 

comer: 

extent 

extent: 

beeh 

height 

height: 

tneetBy:: 

intersection: 

intersects: 

isCopyOf: 

isEqnni: 

left 

left: 

IcftCenter 

moveBy: 

moeeBy:: 

origin 

origin: 

origin:  comer: 
origin:  extent: 
printOn: 
right 

right: 

rightCenter 

top 

top: 

topCentcr 

topLeft 

topRJght 

union: 

width 

width: 


ft' 

w 

w 

w 

i 


I 


Objective-C  Base  Tree  (continued) 


Object 

frM 

ialtuUn 

BdxVkiSiM 

POmAb: 

nBdFrom: 

aiCnph; 

BWBk* 

CBpBcitjr 

cIbm 

eompan: 

copy 

dMpCopy 

duerib* 

ctNotIUcogBiM: 

•rror 

fr** 

hath 

idOlSTR: 

iiCopyOf: 

bEqsal: 

liKindOf: 

itMtmbtrOf: 

US  amt: 

Bamt 

BotEqual: 

BotlmpitmtBltd 

aotSamt: 

ptrfona: 

ptrtonn:  with: 

ptrform:  with:  with: 

priBt 

priatOa: 

priBtStriBf; 

rttpoadiTo: 

ttif 

thallowCopy 
■hould  Notlmpitmtat 

•how 

tut 

•tortOn: 

•tr 

•ubclauRtiponaibility 

•optrClait 


Sequence 

ovtr: 

fint 
-  bM 

UCopyOf: 

But 

onr 

nwiad 

'  String 
adxVarSiu 
adxVarTypt 

BIW 

atw: 

•priatf: 

atR 


aaFloat 

aalat 

atLoBf 

capacity 

capacity: 

char  At: 

charAt:  pot; 

compart: 

comparts  tR: 

coacat; 

coacatSTR: 

copy 

dttcribt 

hath 

UCopyOf: 

UEqual: 

UEqaalSTR: 

priatOa: 

•Ut 

ttr 

•treat: 

Unknown 

adxVarSii 

ndxVarTypt 

BtwClata:  Vara:  onlOD:  Text: 
priatOa: 

capacity 

dttcribt 

dottNotRecopniic: 

iiVarCapacity: 


Appendix  C 

Prototype  Source  Code 


LEX  Specification  File 


t( 

finclude  "objc.h" 
flnclude  "y.tab.h" 

♦define  MON(x)  {  yylval.lex  -  x  ;  return  MONTH  ;  ) 
-  (N.  Collection,  Primitive) 

%1 

%% 

(jj]an("."|uary)?  MON(l)  ; 
tfFleb("."(ruary)?  M0N(2)  ; 


IniMlar("."|ch)?  MONO) 

laMprf.’liin  MONO) 

[mMJay  MON (5) 

IjJ]un(-."|e)?  M0N(6) 

t jJ]ul("."|y)?  M0N(7) 

laAJug("."|u3t)?  M0N(8) 


[3S]op(" I  tember)  ?  MONO)  ; 

(o01ct("."lober)?  MON(IO)  ; 

[nN]ov("."|ember)?  MONdl)  ; 
tdD]ec("."|ember)?  M0N<12)  ; 

[0-9]  (  yylval.lex  »  yytext(0]  -  '0'  ; 

return  DIGIT  ;  ) 

t  )  (  ;  /*  delete  blanJca  */  ) 

"\n"  [  return  EOL  ;  ) 

(  return  EOL  ;  ) 

( 

return  (yytext(O))  ;  )  /»  return  single  cliaracters  */ 

#  include  "3tdio.l>" 

Int  noleap  []  “  ( 

0,  31,  28,  31,  30,  31,  30,  31,  31,  30,  31,  30,  31,  )  ; 

int  leap  ( 1  -  ( 

0,  31,  29,  31,  30,  31,  30,  31,  31,  30,  31,  30,  31,  )  ; 
date  (  month,  day,  year  ) 

< 

int  *day3ln  ; 

days in  -  is leap  <  year  )  1  leap  s  noleap  ; 
if  (month  <  1  I  I  month  >  12  ) 

(  prlntf  (  "month  out  of  range  \n"  )  ; 

return  ; 

) 

if  (  day  <  1  I  I  day  >  daysin [month]  ) 

(  prlntf  (  "day  of  the  month  out  of  rangeXn"  )  ; 
return  ; 


) 

) 

Isleap  (year) 

I  if  (year  94  !-  0  )  return  (0) 

if  (year  *  100  !-  0  )  return  (1) 

if  (year  9  400  !-  0  )  return  (0) 

return  (1) 

1 

int  yywrap  () 

( return (1);  ) 
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YACC  Specincation  File 


/•  stftck  typ«  •/ 

/*  lexical  code  */ 
/•  an  object  •/ 


linclude  *objc.h* 

•  (N,  Collection#  Primitive)  /*  phyla  */ 
extern  id  deteObj  i 

I) 

lunion  i  /*  stack  type  */ 

abort  lex  $  /*  lexical  code  * 

id  obj  I  /*  an  object  */ 

I 

tstert  p'09 
%tokcn<lex>  DIGIT  MONTH 
%token<lex>  COL 
ttype<lex>  nun^r  year  day 
%type<obj>  DateStmt 
tl 

pro9:  DeteStmt  COL  (  exltO;  )  ; 

Datestmt:  MONTH  day  year 

I 

date  (  $1#  $2,  $4  )  i 
$$  •  IdateObJ  mo:  $1  da:  $2  yr:  $4  )  ; 
I  dateObJ  print  )  ; 

1 

I  day  MONTH  year 

I 

date  (  $2#  $1,  $3)  ; 

$$  -  (  daleObj  mo:  42  da:  41  yr:  43  } 

1  dateObJ  print  )  : 

) 

I  number  * /*  number  '/*  number 

I 

if  ($5  <  100)  4S  -  4S  ♦  1900; 
date  (  41#  $3#  $5  )  ; 

$$  «•  I  dateObJ  mo:  41  da:  43  yr:  $S  ) 

1  dateObJ  print  1  t 

) 

I  nuif^r  * 

I 

date  (  X,  1#  $1)  ; 

$4  •  C  dateObJ  mo:  *1  da:  *1  yr:  41  1 
(  datcObJ  print  ]  ; 

1 

i  number 

I 

date  <1#  1#  41)  ; 

$4  •  (  dateObJ  mo:  *1  da:  *1  yr:  41  1 
I  dateObJ  print  )  : 

I 

I  MONTH  nu7rJt>er 

I 

date  i  41#  -1#  42)  ; 

44  •  i  dateObJ  mo:  41  da:  "I  yr:  42)  ; 

I  dateObJ  print  ]  ; 

) 

I  MONTH  '#*  number 

i 

date  <  41#  -1#  43  )  ; 

44  •  I  dateObJ  mo:  41  da:  **1  yr:  43)  : 

I  dateObJ  print  )  ; 

) 


nuTnber:  DIGIT 

I  number  DIGIT 

(  44  •  10  •  41  4  42  ;  } 


flnclude  "stdio.h* 

yyerror  (a)  /*  celled  for  yecc  syntax  error  •/ 

Cher  *a: 

( 

wamin9<a#  (char  •)  0): 

) 

char  ^progname^^stdin"; 

vamln9(s#  tl#  t2#  t3#  t4#  t5#  tS#  t7,  tS#  t9)  /*  print  warning  message  */ 

Chet  •»#  m,  *t2,  •tS#  *t4,  •tS,  ‘te,  *t7,  -tS,  *t9; 

< 

extern  int  yylineno; 

fprintf ietdexr#  *file  %a:  "#  ptogname);  */ 

fprlntf  istderx#  a,  tl#  t2#  t3*  t4#  tS#  t€,  t7#  tB#  t9) ; 

fprlntf (atderc#  *  near  line  %d\n”#  yylineno); 

) 


i 


1^ 


m 


Date  Class  Source  File 


♦include  "objc.h" 

♦include  "y.tab.h” 

char  *  MON[]  =  {  "  ","Jan",  "Feb",  "Mar",  "Apr",  "May",  "Jun", 

"Jul",  "Aug",  "Sep",  "Oct",  "Nov",  "Dec",  )  ; 

“  Date: Object  (  N,  Collection,  Primitive  ) 

{  int  mon,  dy,  year  ;  } 

-  mo: (int)  aMonth  da: (int)  aDay  yr: (int)  aYear 
{  mon  =  aMonth  ; 

dy  =  aDay  ; 
year  =  aYear  ; 
return  self  ; 

1 

-  print 
{ 

if  (dy>0  &&  mon>0  ) 

printf ("<DATE>  %d  %3  %d\n",  dy,  MON(mon],  year  )  ; 

if  (dy  <  0  S&  mon<0  ) 
printf {"<DATE>  %d\n",  year  )  ; 

if  (dy  <  0  4&  mon  >0  ) 

printf  ("<DATE>  %3  %d\n",  MON(mon],  year  )  ; 

/*  insert  code  for  different  type  of  prints  to  account  for  defaults*/ 

) 


Main  Program  Source  File 


♦include  "stdio.h" 

♦include  "objc.h" 

=  (  N,  Collection,  Primitive  ) 
extern  BOOL  msgFlag  ; 
extern  lOD  yyin,  yyout,  msglOD  ; 
id  dateObj  ; 

main  () 

1 


extern  id  Date,  Set  ; 


msglOD  =  stdout; 
msgFlag  =  NO  ; 

dateObj  =  [  Date  new  ]  ; 

printf  (">"  )  ; 
yyparseO  ; 

printf  ("end  yyparse\n") ; 

) 


Sclass  (  Date,  Set,Cltn,  IdArray,  Sequence  ) 
@phyla  (N,  Collection,  Primitive  ) 


Appendix  D 


Merged  File  of  Heterogeneous  Bibliographic  Citations  from  Six 
Database  Sources 


<ACCESSIOM  NO  >  8512902:  8506 

<0aTA6aS£  $0URCE>  BRS/Notionoi  Library  of  Meo<cin«  Dotobese 
<AUTH0R£>  Ell»aon-J-M;  IKhorff-E-A; 

<PAA>  Camoridqe  Hosoitol .  Mas aoc nu ae t t a . 

<TITlE>  More  t bon  o  gateway:  the  role  of  the  emergency  psyehiotry  service 
in  the  community  mentol  health  network 
<PUB  DESC>  Hosp-Commun i t y-Psycn » o t r y .  1985  Eeb.  36(2).  P  180-5. 

<LANGUAGE>  EN. 

<MAJOR  CATEGORY>  COMMON  I TY-MENTAL-MEALTH-CENTERS :  og .  EMERCENC Y-S ER V I CE-HOSP I T A L . 

EMERCENCY-SERVICES-PSYCHI ATRIC  og .  t NT ER I NST 1  TUT  1 ONA L-RE LAT  t ONS .  UENT AL-HE A LTH-SER V  I CES 
og 

<yiNOR  CATEGORY>  ADULT.  BOSTON  CASE-REPORT.  CATCHMENT- AREa-MEALTH 
crisis-intervention.  TEMALE.  MOSPITAL-8eD-CAPAC!TY-300-TO-499.  HUMAN. 
male  MIDOLE-AGE.  role  social-work  or  in  helping  the  emergency  unit 
buiip  closer  relationships  witn  community  agencies  is  its  controct  with 
t ne  stole  to  perform  evaluations  of  oil  odmissions  to  the  stole  hospitoi 
psychiotric  unit  serving  the  cotchment  areo.  The  emergency  unit  performs 
trioge  ana  provides  bockup  for  the  agencies,  coordinotes  the  monogement 
of  multi-ogency  cases,  ond  holds  weekly  educationoi  conferences  for  ogtncy 
staff.  Using  cose  exompies.  the  outhors  illustrote  how  unit  ond  ogency 
staff  collaborate  to  ensure  continuity  of  potient  core.  Author. 

<SB>  M 
<DATE>  1985 
<1SSN>  0022-1597. 

<2N>  21  107 . 567 . 675 . . 

<IM>  8506 
<ED>  850404 

<NO>  MM1 7582  ^  , 

<1tCCESSIOM  NO:>  AI47675  ' 

kOaTaBASE  S0URC£>  OTIC/drol s-t r 

<TRANSLATI0N  0ATE>  Mon  Jul  1  13:33:43  POT  1985  (489098023) 

<OOWNLOAO  OATO  Mon  Jul  1  10:18:29  POT  1985  (469086309) 

<OOWNlOAO  file  NAM£>  gote 

<riEL0S  AND  CROUPS>  17/2 

<ENTRY  CLASSiriCATlON>  UNCLASSIFIED 

<CORPORATE  auThOR>  BOLT  BERANEK  AND  NEWMAN  |NC  CAMBRIDGE  MA 

<title>  pluribus  satelite  imp  development  mobile  access  terminal  network. 

<title  CLASSi ricAT ion>  unclassified 

<0ESCRIPT1V£  N0TE>  OUARTERLY  TECHNICAL  REPT .  NO.  33.  1  FEB-30  APR  84. 

<OATE>  MAY  .  1984 

<PAGINAT 10N>  30P 
<REP0RT  NUM0ER>  6BN-5774 

<C0NTRACT  NUM0ER>  MDA903-80-C-0353 .  N00039-8 1 -€-0400 
CRCPORT  CLASSIFICATION>  UNCLASSIFIED 

<OCSCR|PTORS>  ^SATELLITE  COMMUNICATIONS;  eTERMINALS;  NETWORKS;  SHIPBOARD; 

ACCESS.  MOBILE;  WORK 

<OESCRIPTOR  CLASSIFICATION>  UNCLASSIFIED 

<I0ENT1FIERS>  PLURIBUS  SATELLITE.  PACKET  NETWORKS.  ARPANET.  GATEWAYS 
<lDENTIflER  CLASSIFICATION>  UNCLASSIFIED 

<ABSTRACT>  THIS  OUARTERLY  TECHNICAL  REPORT  DESCRIBES  WORK  ON  THE  DEVELOPMENT 
OF  PLURIBUS  SATELLITE  IMPS:  AND  ON  SHIPBOARD  SATELLITE  COMMUNICATIONS. 

(AUTHOR) 

<ABSTRACT  CLASSIFICATI0N>  UNCLASSIFIED 

<INIT1AL  :nvENTORY>  12 

<LIM1TATI0N  CODES>  1 

<SOURCE  COOE>  060100 

<OOCUM6N7  LOCATION>  NTIS 

<C£OPOLl T ICAL  C00e>  2508 

<TVPE  COC£>  4 

<ACCESSION  NO.>  1103508 

KOATABASE  S0URCE>  dialog  NTIS  FUE  6 

<REPORT  NO  >  <NTIS>  OE850006 1 7/XA8 

<TI7LE>  P os t -P r oc e 9 s I ng  of  6  * b  I  i og r o p h  i  c  Citotions  from  DOE/RECON,  NASA/RECON, 
ond  OOO/OROLS .  Revision  1 

<AUTH0RS>  Ballinger.  W,  A  Hampel,  V  E  ;  Hor  rison,  |  Murphy,  T. 

P 

<PUB  0ESC>  Lowrence  Livermore  National  Lob..  CA .  <Code>  068147000;  9513035  Deportment 
DC.  UCRL-89995-REV  1 .  CONF-84 t 2 «3- 1 -REV . 1 
<0ATE>  Aug  1984 
<PC>  I7p 

<LANGUACE>  Engl  i  sh 

<D0CumEnT  Type>  Conference  proceeding 
<PC>  PC  A02/MF  A01 
<JA>  CRAI8507;  NSAIOOe 
<C0  OF  PUBL>  United  Stotes 

<NT>  I n t e r no t 1 ono I  online  informotion  meeting,  London,  UK.  4  Dec  1964. 

<CN>  W-7405-ENC-48 

<A8STRACT>  We  hove  developed  on  interoctive.  setf— guided  progrom  for  the 
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joint  Do  s  t -0  r  oc  «  s  %  I  nq  of  b  i  D  I  >  i  c  C'lotions  tron  (He  fcderoi  inforMot'Or 

centers  of  the  Otportment  of  Energy  f OOC ' .  tne  Deoortment  of  Defense  (ODD), 
ono  tHe  Nottonol  Aerortoutics  and  Space  Aon  i  n  i  s  t  r  o  t  i  o  n  (NASA).  This  proorom 
is  currently  installed  on  the  Intelligent  Gotevoy  Processor  of  the  Technology 
InforAotion  System  (TlS/iCP)  ot  the  Lowrence  Livermore  Nottonol  Loporotpry 
ono  is  under  evoluotion  by  the  TIS  user  community  from  remote  terminols 
by  telephone  dioi'-up.  over  TYMNCT.  ond  the  ARPA  computer  netmork.  Users 
ore  individuQily  outhorized  for  outomoted  access  to  specific  informotion 
centers,  ono  use  stonoord  eommonds  (or  the  do«ntooo«n9.  compifotion. 
end  online  review  of  citations  in  o  common  format.  Previously  reported 
pos t -p r oe e s s 1 nq  cooobilities  nove  been  further  exponoed.  permitting:  (1) 

online  citotion  review,  co tego r i zo t r on .  ona  oddition  of  new  doto  elements; 

(2)  disossempiy  dno  re-'Otsembly  of  citotions;  (3)  stotisticol  onolysis 
of  doto  field  contents:  (4)  c  r  oss'^cor  re  1  o  1 1  on  of  doto  field  contents: 
ond  (S)  concordance  generotion.  In  oddition.  the  new  two-^poss  interpreter 
for  the  pos t -p r oce ss I ng  program  permits:  the  1  rone f o rmo t < on  of  obbrev«oted 
doto  field  names  into  English  nomee  preferred  by  eoch  ogency.  the  stotisticol 
onolysis  of  the  density  end  completeness  of  doto  fields  in  selected  sets 
of  b I b i i og roph ( c  citotions.  the  eliminotion  of  redundont  citotions  (using 
use r—spec i f i ed  criterio).  ond  trend  onolysis.  The  letter  is  o  powerful 
tool  for  the  exploration  of  t ime^dependent  cho r oe t e r i s t i cs  in  o  portieulor 
field  of  reseorch,  of  on  orgon i 20 t i on .  or  for  on  outhor.  Grophicol  disploye 
of  publicotion  rotes  os  o  function  of  time  ond  the  hormolized  stotistics 
of  terms  used  in  the  description  of  the  work,  con  be  used  to  signol  new 
directions  of  ongoing  reseorch  ond  the  intensity  ot  its  support.  (ERA 
citotion  10:90(706) 

<DESCRIPT0RS>  •Informotion;  •Computer  Networks;  Informotion  Retrievol; 

Spec i f i cot  I ons 

<1ndeKing  Terms>  Cf^OA/dddjed;  NTISOC 

<SH>  S6  (Behoviorol  ond  Socioi  Sc i ences— ‘Document ot i on  ond  Informotion  Technology): 
96  (Electronics  ond  Electrical  Eng  1 neer 1 ng—»Compu ( e r s ) ;  686  (Librory 

ono  Informotion  Sc  i  enees-'*!  nf  ormot  •  on  Systems),  628  (Computers.  Control, 
end  Information  Theory«--Computer  Soltwore) 

^ACCESSION,  H0.>.'^4C».166S56 
<0ATaBASE  SOURCO  OOC/recon 

<TRANSl.AriON  0ATE>  Mon  Jul  1  13:33:43  POT  1085  (489698023) 

^download  0ATE>  Mon  Jul  1  10:18:29  POT  198$  (489686369) 

<OOWNLOAO  FILE  NAME>  gote 

<REP0RT  NO. PACO  UCRL— 8999S-Rev.  t  P.  f  7;  DEd50066  I  7 

<T  I  Tl£(MONO)>  Pos  t*>p  r  oce  ss  I  ng  of  b  i  b  I  i  ogr  oph  i  c  Citotions  from  OOE/RECON, 

NASA/RECON.  end  DOO/OROLS.  Revision  1 
<C0i7OR  OR  C0mP>  Bollinger,  W.A.;  Hompel.  V.£.;  Horrison.  Murphy.  T.P. 

<C0RP0RATE  AUTh>  Lawrence  Livermore  Notionol  Lob.,  CA  (USA) 

<C0RP0RATE  COOE>  9$13e3$ 

<TtPE>  R 

<SEC  REPT  N0>  C0NF-.841243— 1-Rev.  1 
<PACC  N0>  17 

<AVAJLABILITY>  NTIS.  PC  Ae2/MF  A61 
<0R0ER  NUMSERb  0C6$eee6l7 
<C0NTRACT  N0>  Cent  root  «i>74e$>ENG-46 

<C0NF  T1TLE>  8.  i nt e r no t ( one  I  online  informotion  meeting 

dCONF  PLACO  London.  UK 

<C0NF  0ATE>  4  Dee  1984 

<DATE>  Aug  1984 

<CO  OF  AUTH>  US 

<CO  OF  PUBL>  US 

<ANN  J>  eRA-ie:001766;EOB-84: I8a$$5 
<01STRieUT10N>  MN-32 
<OOCUMENT  0R1C1N>  P 
<B1S>  TIC 

<CATEGORICS>  £06-996306 

<PRIMARY  CAT>  E0B-99636e(CENCRAL  AND  MISCELLANEOUS;  INFORMATION  HANDLING) 

<A8STRACT>  Re  hove  developed  on  interoctive.  self-guided  program  (or  the 
joint  bos t-process i ng  of  b • b I  1 og r oph 1 c  citotions  from  the  federgi 
information  centers  of  the  Deportment  of  Energy  (DOE),  the  Deportment  of 
Defense  (000),  end  the  Notionoi  Aeronoutics  ond  Spoce  Adm i n 1 s t r o ( 1  on 
(NASA)  This  progrom  is  currently  inetolied  on  the  intelligent  Cotewoy 
Processor  of  the  Technology  Informotion  System  (TlS/iCP)  ot  the  Lowrenct 
Livermore  Nationol  Loborotory  ond  is  under  evoluotion  by  the  TIS  user 
community  from  remote  terminols  by  telephone  dioi-up.  over  TYMNET,  ond  the 
ARPA  computer  network.  Users  ore  individuolly  outhorired  for  outomoted 
occesa  to  specific  informotion  centers,  ond  use  stonoord  eommonds  for  the 
dewnioo^ing,  compilation,  and  online  review  of  eitot>ons  in  o  common 
format.  Previously  reported  post-processing  copobilities  hove  been  further 
ewponded.  permitting:  (1)  online  citotion  review,  c o t ego r < z 0 1 1  on ,  ond 
Oddition  ot  new  doto  elements;  (2)  disossembiy  ond  re-otsembfy  of 
Citotions;  (3)  stotisticol  onoiysis  of  doto  field  contents;  (4) 

C r os s-c o r r e I o I  I  on  of  doto  field  contents;  ond  ($)  concoroonct  generotioev 
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In  oddition.  the  hew  two'^ooss  interpreter  fpr  tne  do  ft  t  ~p  r  oc  e  s  s  i  n^  prooror 
pernitts;  the  t  r  on  s  f  o  r  mo  t  i  on  of  obbrevioteo  ooto  fieio  nomes  into  enqiisr 
names  preferred  by  eoch  agency,  the  stotisticot  onotyftis  o<  the  oensitv 
ono  c omp I e t e ne 38  of  dato  fields  in  selected  sets  of  b  i & t  i  09 ropn i c 
citotions,  the  eiiminotion  of  redunoont  citot>cns  (ufttng  use r  —  s pec  I f i ed 
criteria),  ond  trend  onolysis.  The  lotter  is  0  powerful  tool  for  the 
exploration  of  t  imo'-dependent  ehoroc  t  er  t  st  i  cs  in  o  porticuior  field  of 
reseorch.  of  on  o r gan  1  jo t  i  on .  or  for  on  outhor.  Crophieol  dispioys  of 
publieotton  rates  os  o  function  of  time  ond  the  normoiized  stotistics  of 
terms  used  in  the  description  of  the  work,  con  be  used  to  sipnoi  new 
directions  of  ongoing  research  and  the  intensity  of  its  support. 

<DESCR 1 PTORS>  •INFORMATION — computer  ne t wo r k s :  I NFORMAT 1  ON  RETRIEVAL. 

SPECIFICATIONS 
<ISSUC>  6423 
<DOCUMENT  N0>  64:186555 
^ACC&SSiON.^.MO^.ii/84C017369/ 

<0ATABASE  S0URC£>  OOE/recoh 

<TRANSLATION  0ATE>  Mon  Jul  1  13:33:43  POT  1965  (469698023) 

<OOWNLOAO  OATE>  Mon  Jul  1  10:16:29  POT  1965  (489086309) 

<OOWNLOAO  FILE  NAME>  gets 

<REP0RT  N0.PAGE>  UCRL — 91383  P  1 0 : OE85001 741 

<T 1TLC(M0N0)>  Integrotion  of  on  outomoted  Mbrory  support  system  with  on 
i n  t  e I  I  i gen t  go t  ewpy 
<eDlTOR  OR  COMP>  Burton,  H.D. 

^CORPORATE  AUTH>  Lawrence  Livermore  Notionol  Lob..  CA  (USA) 

<CORPORATE  COOE>  9513035 
<TYPE>  R 

<<SEC  RCPT  N0>  C0NF-d4e9l3e — 1 
<PAGE  NO>  10 

<AVAILABILITY>  NTIS.  pc  A02/Mr  A01. 

<OROtR  NUMeER>  Oe850ei741 
<C0NTRACT  N0>  ContfflCt  R-7405-ENG-48 

<CONF  TITLO  Integrated  online  librory  systems  conference 

<CONF  PLACE>  Atlanta.  GA.  USA 

<CONF  0ATE>  1 3  Sf p  1 984 

<OATe>  Aug  1964 

<C0  OF  AUTM>  us 

<C0  OF  PU0L>  US 

4ANN  J>  EOB-84  173691 

<OISTRI8UTION>  MN-32 

<OOCUMENT  0R1GIN>  P 

<eis>  T IC 

<CATEG0R1ES>  EOS<'99e30e 

<PRtMARY  CAT>  EO8-99e300(CENERAL  AND  MISCELLANEOUS:  INFORMATION  HANDLING) 
<ABSTRACT>  a  new  project  of  the  Technology  lnformot»on  System  (TIS)  ot  the 
Lowfence  L»vermore  National  Loborotory  (LLNL)  colls  for  the  evoluotion  of 
commercially  ovoiloble  librory  support  pockoges  ond  the  smtension  ond 
integrotion  of  the  most  dssiroble  system  with  the  TIS  gotewoy  to  provide  0 
comp r ehe n s I  V e  p'ototype  for  fibrories  ond  informotion  centers.  This 
prototype  system  is  planned  to  focilitote  occsss  to  ond  monogsmsnt  of 
in-house  activities  such  os  cotdlogirig,  seriois  control,  ond  acquisitions, 
os  well  os  to  interfoce  to  externol  systems  ono  services  for  doto 
dewnlooding  ond  sxchonge,  retrieval,  ond  pos t—p r oc e $ s i ng .  Cooperotive 
cotologing.  distributed  dotobose  processing,  siectronic  i  n  t  e  r<>  I  •  b  r  o  r  y 
loon,  ond  customized  bibliogrophy  production  ore  some  of  the  feotures 
plonned  for  the  prototype.  Development  of  o  user-friendly  front-end 
processor  will  0 1  1 ow  the  user  to  negotiote  his  seorch  query  in  o 
sem I  — ou t omo t ed  monner  using  a  single.  English— like  eommond  language.  The 
TIS  ot  Lowrence  L*vermore  Notionc'  Loborotory  (LLNL)  hoe  developed  o 
c ompu t e r-oo s e d  intelligent  gotewoy  for  outomoted  occcss  to  such  diverse, 
geogroph  •  CO  I  I  y  distributed  informotton  systems  os  DOE/RECON,  DOO/OROLS. 
NASA/RECON.  CAS  On-L«ne.  QARC  (Fror>ce)  ond  OECMtMA  IRest  Germony).  among 
mony  others.  New  informotion  resources  centers  ore  being  added  os  required 
and  users  con  connect  s « mg  1 t oneou s 1 y  to  more  Ihon  one  host  to  compart 
their  doto  The  TIS  online  master  directory  provides  the  user  with  0 
Single,  integroted  view  of  ovoiloble  ond  relevont  resources  The  outomoted 
occess  procedures  permit  tne  user  to  concentrote  on  the  informotion 
ospects  of  his  work  rother  tnon  be  burdened  with  vorious  log-on 
procedures,  dotobose  formots  ond  protocols.  The  merger  of  the  librory 
support  with  (he  TIS  gotewoy  should  provide  users  with  o  copobilities  to 
occess  ono  utilise  the  full  spectrum  of  testuot,  numeric  ond  grophics  doto 
resources 

<OESCR I PTORS>  •information  SYSTEMS — computer  ne t we r k s ; D A T A  BASE  MANAGEMENT: 

LAWRENCE  LIVERMORE  LABORATORY 
<issue>  8421 

CUPPOSTEO  DESC>  MANAGEMENT .NATIONAL  ORGAN  I  2 AT  I ONS : US  AECUS  DOE. US  EROA;US 
ORCANI 7AT IONS 
<D0CumENT  no>  64  173691 
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<ACCtSS10N  NO  >  84N33e99| 

<0ATA8ASC  SOURCC>  NASA/rtCOn 

<TRANSLAT10N  OATE>  Mon  Jut  ^  13:33:43  POT  1985  (489098823) 

<OOWNLOAO  0A7E>  Mon  Jul  1  10:18:29  POT  1985  (489086309) 

<OOWNtOAO  PILE  NAMO  gotc 
<1SSUE>  22 
<PAGC>  3643 
<CATCGORY>  62 

<RPT#>  OCe4.0l32ie  UCRU-90276  C0Nr-64e6 1 39-1 
<CNT|>  VI-7405-CNG-48 
<OATe>  1984 
<PAGES>  122 

<DOC.  CLASS]F.>  UNCLASSIFIED 

<TITLE>  An  onlin*  diroctofy  of  dotobosos  for  moltriel  proportiet 
<AUTHORS>  HAMPEL,  V.  £.;  eOLLlNCER.  W.  A.;  CAYNOR .  C.  A.;  OLOANl.  J.  J. 

<PAA>  C/(Controi  Ooto  Corp.) 

<PU8  OESC>  Colifornio  Un<«.,  Livorikoro.  Lowronco  LtvorAoro  Lob.  CSS: 

(Toehnoiogy  Informotien  Sy«t««.)  AVAIL. NTIS  SAP:  HC  AOS/MF  A01  Prooontod 
ot  th«  9th  Intorn.  COOATA  Conf . .  Jorutolom.  24—28  Jun.  1984 
<DESCR1P70RS>  DATA  BASE  MANAGEMENT  SYSTCMSiOATA  BASES : D 1 RECTOR! ES : I NPORMAT ION 
DISSEMINATION: INFORMATION  SYSTEMS 

<MINS>  /  COMPUTER  NETWORKS/  COMPUTER  TECHNIQUES/  DATA  PROCESSING/  ON-LINE 
SYSTEMS  /  STATISTICAL  ANALYSIS 
<AeA>  DOE 

<A6STRACT>  An  on  I i no  diroctory  of  dotobosoo  of  AOtoriol  preportitt  on  tht 
Technology  InforAotion  Syateo  ot  Lovrenc*  Livernore  Notionol  Loporotory 
(LLNL/TIS)  ie  deteribed.  Thio  directory  it  intended  to  provide  interoctive 
oeceee  to  ecientific  ond  technicol  dotoboeee  ovoiiobie  to  the  public  thot 
contoin  inforeotion  pertoining  to  nucieor.  otoeic,  ooleculor,  phyeicoi. 
chemicol«  ond  Aechonicol  propertiee  of  eubetoncee.  In  oddition  to  the  101 
doto  files  previously  ore  reported.  The  inforisotion  is  updoted  with  isore 
thon  38  nueeric  dotoboses  ond  prediotive  systems  in  these  fields.  In 
oddition  to  describing  the  contents  of  the  dotoboses.  updoted  informotion 
ie  provided  on  the  ovoilobility  of  the  dotoboses  end  their  online  oecess 
over  public  telephone  ond  doto  netmorke.  Some  of  the  numeric  dotoboses  ore 
directly  occessible  by  outhorized  users  vie  the  TIS  Intelligent  ••  Getewoy 
••  Processor  ot  LLNL  (TtS/tCP).  with  self-guiding  procedures  for  the 
downiooding.  merging,  poet-proceeeing.  ond  grophicel/etot i it ieei  enelysis 
of  doto. 

^database  S0URCE>  dialog  NTIS  FILE  6 
<REPORT  N0.>  <NTIS>  OE84ei3210 

<T1TLE>  Online  Directory  of  Dotoboses  for  Moterlol  Properties 
<auTh0RS>  Hompei.  V.  E.  ;  Bollinger.  W.  A.  ;  Goynor.  C.  A.  ;  Oldeni,  j. 

J  . 

<PUS  0ESC>  Lowfence  Livermore  Notionol  Lob..  CA.  <Code>  068147000;  9513035  ;  Deportment 
DC.  ;  UCRL-90276;  CONF-8406> 39-1 

<0ArE>  Moy  1904 
<PC>  122p 

<AV>  Portions  ore  illegible  in  microfiche  products. 

<LANGUAGE>  English 

<OOCUMENT  TYPC>  Conference  proceeding 
<PC>  PC  A06/Mr  A91 
<JA>  GPAie423;  NSA09O0 
<C0  OF  PU6L>  United  Stotes 

<NT>  1  nt e r not i one  I  COOATA  conference.  Jerusolern.  Isroel.  24  Jun  1904. 

<CN>  W-7405-ENG-48 

<ABSTRnCT>  We  hove  crested  on  online  directory  of  dotoboses  of  moteriol 
properties  on  the  Technology  Informotion  System  ot  Lowrence  Livermore 
Notionol  Loborotory  (LLNL/TIS).  This  directory  is  intended  to  provide 
interoctive  occess  to  scientific  ond  technicol  dotoboses  ovoiiobie  to 
the  puolic  thot  contoin  informotion  pertoining  to  nucieor.  otomic.  moleeulor. 
physicol,  ehemtcoi.  ond  mechonicol  properties  of  substances.  The  directory 
is  boseo  on  worii  dene  eorlier  by  Joseph  Hilsenroth  of  the  Notionol  Bureau 
of  Stondords  (NBS/OSRO)  and  Jock  H  Westbrook  of  Generol  Electric  Corporotion. 

In  oddition  to  the  101  doto  files  previously  reported,  we  hove  updoted 
the  informotion  ond  identifted  more  thon  38  new  numeric  dotoboses  ond 
predictive  systems  in  these  fields.  We  hove  included,  where  opplicoble. 
entries  conto>ned  in  the  directories  published  by  Cuodro  Associotes, 
codata,  ond  UNESCO.  In  oddition  to  describing  the  contents  of  the  dotoboses, 
we  hove  provided  updoted  informotion  on  the  ovoilobility  of  the  dotoboses 
end  their  online  occess  over  public  telephone  ond  doto  networks.  The 
online  directory  is  prepored  for  use  by  scientists  and  engineers  ond 
Should  enhonce  the  shoring  of  S  ond  T  resources  over  commun i c o t i on  networks. 

This  directory  is  SKpecled  to  becomo  borticulorly  importent  to  the  notionol 
ond  i n t e r no t I ono I  mognetic-  ond  loser-onorgy  fusion  projects,  nucieor 
criticolity  sofety,  ond  eomputor  oided  engineering  progroms.  Some  of 
the  numeric  dotoboses  ore  directly  occessible  by  authorized  users  vio 
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d«scriD«s  th«  oppiications  which  hov*  h*Aefit*d  from  ARPANET  during  (he 
roporting  period.  P i no  I  I y .  it  ditcusses  on  i nves i i got i on  of  the  lecnniguct 
for  focdimiie  tronamiesion  between  different  devteee  over  the  networn. 
Corner  work  in  ottoeking  hotia  by  front-end  techniques  hot  been  broodeneo 
to  provide  ••  gotewoy  ••  facilities  between  computer  networks.  Here, 
pursued  were  two  lines.  An  interne  twork  Tronsnisston  Control  Protocol  TCP 
hos  been  implemented  which  is  designed  to  be  opplicobic  to  o  host-host 
protocol  between  hosts  on  different  networks.  Experiments  to  test  the 
properties  of  this  protocol  nove  been  storted  between  UCL.  Stonford  U  ond 
Bolt.  Beronek  ond  Newmon  B6N.  More  effort  hos  been  put  into  designing  ond 
implementing  ••  gotewoy  ••  functions  when  o  specific  node  octs  os  o  •• 
gotewoy  ••  between  two  networks  ond  performs  o  mopping  between  the 
stondord  protocols  of  eoch.  Investigoted  wos  the  opp I i cob i i i t y  of  this 
opprooeh  to  severol  networks,  including  the  connection  oo  ARPANET  ond  the 
UK  Post  Office  Ckperimentol  Pocket  Switched  Service  CPSS.  Preliminory 
results  show  thot  the  technique  should  be  feosible.  but  since  the  other 
networks  ore  not  yet  operotionol.  the  technique  wos  not  demons t r o t ed . 
rfinnrii7inf>~ttni,iM 

<OATABASC  SOURCt>  SOC/UibroTy  ond  Info 

<TRANSLATION  OATO  Mon  Jul  1  13:46:50  POT  1985  (469698456) 

<OOWNLOAO  0ATC>  Mon  Jul  1  16:56:46  POT  1985  (489666246) 
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<0ATe>  1976 
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